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PROTEIN BIOM ARKERS THAT DISTINGUISH 
PROSTATE CANCER FROM NON-MALIGNANT CELLS 

CROSS-REFERENCES TO RELATED APPLICATIONS 

5 This application claims the benefit of U.S. Provisional Application 

No.60/358,986 filed February 21, 2002, which is incorporated herein by reference. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 

This invention was made with government support under grant no. UOl 
1 0 CA85067 awarded by the National Cancer Institute. The government may have certain 
rights in this invention. 

BACKGROUND OF THE INVENTION 

Prostate cancer is the most common form of cancer in males. It typically 
afflicts aging males, but it can afflict males of all ages. A significant number of males die 

1 5 from prostate cancer every year, and it is the second leading cause of cancer deaths in 
men. Early diagnosis of prostate cancer in patients reduces the likelihood of death. 

The effectiveness of any diagnostic test depends upon its specificity and 
selectivity. That is, what is the relative ratio of true positive diagnoses, true negative 
diagnoses, false positive diagnoses and false negative diagnoses? Methods of increasing 

20 the percentage of true positive and true negative diagnoses for any condition are desirable 
medical goals. In the case of prostate cancer, the present diagnostic tests are not 
completely satisfactory, in that they provide significant numbers of false positive and 
false negative results. 

Conventionally, prostate cancer is diagnosed using prostate specific 

25 antigen (PSA) as a marker. In general, PSA levels above 4 ng/ml are suggestive of 
prostate cancer while levels above 10 ng/ml are highly suggestive of prostate cancer. 
However, if the cancer is in its early stages, some prostate cancer patients exhibit normal 
PSA levels at the time of diagnosis. Since conventional PSA tests detect abnormal levels 
of PSA, conventional PSA tests may not be able to detect the presence of prostate cancer 

30 if it is in its early stages. This results in a false negative diagnosis. The inability of 
conventional PSA tests to diagnose the presence of prostate cancer in some instances 
(e.g., in the early stages of the disease) can be detrimental to the patient. Moreover, many 
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individuals with elevated levels of PSA in the blood serum may not have prostate cancer, 
but may instead have benign prostate hyperplasia (BPH) {i.e., a benign tumor) or prostate 
intraepithelial neoplasia (PIN). This results in a false positive diagnosis. In order to 
determine if a person has prostate cancer, rather than BPH or PIN, additional 
5 immunoassays using other antibodies and/or biopsies of the prostate tissue are performed. 
These additional tests are time consuming and expensive for both patients and their care 
providers. 

There is some consensus in the medical community that better diagnosis 
will result from the discovery of more disease markers that can be used alone or in 
10 combination to increase the specificity and selectivity of diagnostic tests. 

SUMMARY OF THE INVENTION 

Several organic biomolecules (e.g\, proteins, glycoproteins, nucleoproteins 
and the like) have been discovered that possess molecular weights capable of functioning 
as markers in prostate cancer ("PCA"), prostate intraepithelial neoplasia ("PIN"), or 

1 5 benign prostate hyperplasia ("BPH") versus a negative diagnosis. Compared to a 

negative diagnosis, the markers are, variously, more frequently detected, less frequently 
detected, or differentially detected. The detection and amount (either absolute or relative) 
of these markers, alone or in combination, in patient samples provides information that 
the diagnostician can correlate with probable diagnoses of prostate cancer, benign 

20 prostate hyperplasia or with a negative diagnosis (e.g., normal or disease-free). Each 
marker can be distinguished from other biomolecular masses as the source from which 
the markers are derived is controlled and/or particular purification procedures are 
employed that minimize the potential of false positives, as described herein. As is also 
described herein, there are several methods that allow resolution and detection of the 

25 marker molecular weights. In preferred embodiments, the method of resolution involves 
Surface-Enhanced Laser Desorption/Ionization mass spectrometry, in which the surface 
of the mass spectrometry probe plays an active role in the desorption and ionization of the 
analyte. 

A set of markers was identified in prostate epithelial cell lysates derived 
30 from laser capture microdissection (LCM). The Marker Set includes the following: 
Marker EP1: 3448 ± 19 Da, Marker EP2: 4036 ± 22 Da, Marker EP3: 4361 ± 24 Da, 
Marker EP4: 4413 ± 24 Da, Marker EP5: 4639 ± 26 Da, Marker EP6: 4749 ± 26 Da, 
Marker EP7: 4827± 27 Da, Marker EP8: 5666 ± 31 Da, Marker EP9: 8445 ± 46 Da, 
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Marker EP10: 11744 ±65 Da, Marker EP 11: 14696 ±81 Da, Marker EP12: 24184 ± 133 
Da, Marker EP13: 48308 ± 266 Da, and Marker EP14: 53830 ± 296 Da. These markers 
are further characterized by the ability to bind to a metal chelate adsorbent and washed 
with an eluant at neutral pH. This implies that they contain metal binding residues. 
5 Markers EP2, EP3, EP5, EP6 and EP12 occur in a greater frequency and abundance in the 
PIN and PCA cell lysates. Markers EP4, EP8, EP13, and EP 14 appear in greater 
abundance in normal and BPH cell lysates. Marker EP10 had a decreased abundance in 
BPH, PIN, and PCA cell lysates. Markers EP5 and EP12 are found only in PCA cell 
lysates. 

1 0 The in addition to mass characteristics, the markers may also be 

characterized by affinity characteristics. For example, the molecular weight and metal 
binding characteristics are provided herein. We point out that molecular weight and 
metal binding are characteristic properties of these markers and not limitations on means 
of detection or isolation. Furthermore, once partial amino acid sequences are obtained, 

1 5 each marker can be identified unambiguously by matching the sequence in a protein 
database. 

One aspect of the invention is a method of qualifying a prostate cancer 
status in a subject comprising: a) measuring at least one marker in a sample from the 
subject, wherein the marker is selected from the group consisting of those markers noted 

20 above as being identified in prostate epithelial cell lysates and combinations thereof; and, 
b) correlating the measurement with prostate cancer status. 

In one embodiment of this method, the prostate cancer status is selected 
from the group consisting of prostate cancer (PCA), prostate intraepithelial neoplasia 
(PIN), and benign prostate hyperplasia (BPH). In another embodiment the sample is 

25 prostate tissue extract. Still another embodiment is where measuring comprises 

determining the mass of the protein, which an be by mass spectrometry, preferably is gas 
phase ion spectrometry, more preferably laser desorption ionization mass spectrometry. 

Samples used in the method may be selected from the group consisting of 
blood, serum, urine, prostatic fluid, seminal fluid, semen, and prostate tissue. 

30 Measuring marker amount may further comprise i) fractionating the 

sample; ii) binding a fraction of the sample to an adsorbent; and, iii) comprises detecting 
the protein marker by gas phase ion spectrometry. Alternatively, marker amount may be 
detected by immunoassay. 
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The absorbent used in the methods may be selected from the group 
consisting of a hydrophilic adsorbent, a metal chelate adsorbent, and a strong anion 
exchange adsorbent. 

In another embodiment of the method, measuring the biomarker(s) 
5 comprises i) embedding a portion of a tissue specimen harvested from a patient in OCT 
and freezing the specimen; ii) obtaining cryosections from the tissue specimen; iii) 
obtaining cell samples from the cryosections by laser capture microdissection; iv) mixing 
cell samples from step (d) with lysis buffer thereby producing cell lysates; v) diluting and 
vortexing the cell lysates; vi) centrifuging the vortexed cell lysates thereby producing a 

10 supernatant fraction; vii) binding the supernatant fraction to an adsorbent; and, viii) 
comprising detecting the protein marker using gas phase ion spectrometry. The 
adsorbent used may be selected from the group consisting of a hydrophilic adsorbent, a 
metal chelate adsorbent, and a strong anion exchange adsorbent. The sample may 
comprise any number of marker combinations, for example simply marker EP8, markers 

1 5 EP2 and EP3, EP2 and EP5, markers EP3 and EP5, markers EP2 and EP6, markers EP3 
and EP6, markers EP5 and EP6, markers EP2, EP3 and EP5, markers EP2, EP3 and EP6, 
or markers EP2, EP3, EP4, EP5, EP6, and EP8. 

In one embodiment, correlating the measurement with prostate cancer 
status includes: i) generating data for each protein marker with the mass spectrometer, the 

20 data comprising a mass/charge ratio and an amount determination for each ion 

corresponding to each protein marker; ii) transforming the data into computer-readable 
form; and, iii) executing an algorithm with a programmable digital computer, wherein the 
algorithm determines closeness-of-fit between the computer-readable data and a data set 
indicating a diagnosis of PCA, PIN, BPH or a negative diagnosis. 

25 In another aspect this invention provides a method for aiding in a diagnosis 

of prostate cancer, prostate intraepithelial neoplasia, or benign prostate hyperplasia. The 
method involves: a) detecting at least one protein marker selected from the Marker Set (as 
defined herein) in a sample from a subject; and b) correlating the detection of the marker 
or markers with a probable diagnosis of prostate cancer, prostate intraepithelial neoplasia, 

30 benign prostate hyperplasia or a negative diagnosis, wherein the correlation takes into 
account the relative detectability of the marker or markers in each diagnosis. 

In certain embodiments, the sample may be selected from seminal plasma, 
blood, serum, urine, prostatic fluid, seminal fluid, semen, and prostate tissue. In the 
preferred embodiment, the marker or markers is detected in cell lysates from prostate 
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tissue. In other embodiments, the marker or markers are detected by gas phase ion 
spectrometry or, more particularly, laser desorption mass spectrometry. In another 
embodiment, the marker or markers are detected by immunoassay. In other 
embodiments, the method comprises detecting a plurality of the markers. In other 
5 embodiments, the method involves detecting at least marker EP2 and a second marker 
selected from EP3, EP5, or EP6. In another embodiment, the method comprises detecting 
at least marker EP3 and a second marker selected from EP5 or EP6. In another 
embodiment, the method comprises detecting at least marker EP5 and EP6. In another 
embodiment, the method comprises detecting at least markers EP2, EP3 and EP5. In 

10 another embodiment, the method comprises detecting at least markers EP2, EP3 and EP6. 
In another embodiment the method comprises: i) generating data on the sample with the 
mass spectrometer indicating intensity of signal for mass/charge ratio, ii) transforming the 
data into computer-readable form; and iii) executing an algorithm with a programmable 
digital computer, wherein the algorithm detects signal in the computer-readable data and 

1 5 wherein detecting the markers is indicative of a diagnosis of PCA, PIN, BPH or a 
negative diagnosis 

In another aspect this invention provides a method for detecting at least 
one protein marker from the Marker Set in a sample, wherein the method comprises 
detecting the marker or markers by gas phase ion spectrometry. 

20 In one embodiment the method comprises: i) generating data on the 

sample with a mass spectrometer indicating intensity of signal for mass/charge ratio, ii) 
transforming the data into computer-readable form; and iii) executing an algorithm with a 
programmable digital computer wherein the algorithm detects signal in the computer- 
readable data representing the marker or markers and wherein detecting the markers is 

25 indicative of a diagnosis of PCA, PIN, BPH or a negative diagnosis. 

In another aspect of this invention, a marker or multiple markers are 
detected in samples and an artificial intelligence program such as fuzzy logic, cluster 
analysis, or neural network are used to analyze the data. 

In another embodiment the method further comprises i) before detecting 

30 the marker or markers, fractionating a sample comprising the marker or markers by 

contacting the sample with a substrate comprising an adsorbent that retains the marker or 
markers and removing unretained sample; and ii) desorbing and ionizing the retained 
markers from the adsorbent during mass spectrometry. In certain embodiments of this 
method the substrate is a mass spectrometer probe comprising the adsorbent on a probe 
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surface. In another embodiment, the substrate is a resin, and, after fractionating the 
sample, the resin with the marker or markers retained by the adsorbent is placed on a 
mass spectrometer probe for desorption and ionization by the mass spectrometer. In 
another embodiment the adsorbent is selected from a hydrophilic adsorbent, e.g., an 
5 anionic adsorbent, and a metal chelate adsorbent, e.g., a copper chelate adsorbent. 

In one embodiment involving laser desorption/ionization mass 
spectrometry the method comprises: i) providing a probe adapted for use with a mass 
spectrometer comprising an adsorbent attached thereto; ii) contacting the marker or 
markers with the adsorbent; iii) desorbing and ionizing the marker or markers from the 

10 probe; and, iv) detecting the desorbed/ionized marker or markers with the mass 

spectrometer. In another embodiment the method comprises: i) providing a substrate 
comprising an adsorbent attached thereto; ii) contacting the marker or markers with the 
adsorbent; iii) placing the substrate on a probe adapted for use with a mass spectrometer 
comprising an adsorbent attached thereto; iv) desorbing and ionizing the marker or 

1 5 markers from the probe; and, v) detecting the desorbed/ionized marker or markers with 
the mass spectrometer. In other embodiments the adsorbent is a hydrophilic adsorbent 
(e.g., silicon oxide) or a metal chelate adsorbent (e.g., copper chelate). In another 
embodiment, the adsorbent comprises an antibody that specifically binds to the marker. 

In another aspect the invention provides a purified protein selected from 

20 the protein markers of the Marker Set. In one embodiment, the purified protein is 
produced by a process comprising fractionating a sample comprising the marker or 
markers by size exclusion chromatography and/or anion exchange chromatography, then 
collecting a fraction that includes the marker or markers. 

In another aspect this invention provides a kit comprising: (1) an adsorbent 

25 attached to a substrate, wherein the adsorbent retains a protein marker selected from the 
Marker Set; and (2) instructions to detect the marker or markers by contacting a sample 
with the adsorbent and detecting the marker or markers retained by the adsorbent. In one 
embodiment of this invention the substrate is a probe for a gas phase ion spectrometer 
having a surface on which the adsorbent is attached. The adsorbent can be, e.g., a metal 

30 chelating adsorbent (e.g. , MAC3 Cull chip). In another embodiment the kit further 
comprises (1) an eluant wherein the marker or markers are retained on the adsorbent 
when washed with the eluent, or (2) instructions for washing the adsorbent with the eluent 
after contacting the adsorbent with the marker or markers. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1: Representative SELDI spectra of one LCM sample from PCA 
epithelial cells in m/z range of (a) 3000-10,000, (b) 10,000 to 20,000 and (c) 20,000 to 
80,000. An average of 70 peaks were detected for each cell lysate analyzed. 
5 Figure 2a: Representative spectra and gel-view of matched prostate 

epithelial cell lysates showing protein alterations in the mass/charge(m/z) range of 4000- 
6000. (arrows indicate over-expressed peaks in PIN and PCA.) 

Figure 2b: SELDI gelview protein profiles of lysates prepared from cells 
procured from different prostatectomy specimens. The red box identifies a peak with an 
10 average mass of 4749 Da that appears to be more abundant in PIN and PCA epithelial 
cells. Replicate samples(*) show good reproducibility of the protein patterns. The 
different protein patterns observed for two different PCA foci in Patient #2 and the two 
PIN foci in Patient #3, may be the result of genetic heterogeneity 

Figure 2c: Representative spectra and gel-views of matched prostate 
1 5 epithelial cell lysates showing increased abundance of a peak at m/z of 5666 in the BPH 
sample. 

Figure 3: Example of protein peaks found between 20,000-70,000 Da. 
Differential expression of a peak at 28400 Da, thought to be PSA was observed in 
epithelial cell lysates, but absent in lysates from adjacent stromal cells. The intracellular 
20 free PSA appears to be under-expressed in the PIN and PCA epithelial cell lysates. The 
serum albumin peak at 66800 Da was present in each sample. 

Figure 4: Expression levels of peptide/proteins compared to the expression 
levels in matched normal epithelial cells. The average expression in normal cell lysates 
was subtracted from the expression levels found in the other cell types. Upward bars 
25 indicate over-expression, downward bars indicate under-expression. 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

I. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
30 have the meaning commonly understood by a person skilled in the art to which this 

invention belongs. The following references provide one of skill with a general definition 
of many of the terms used in this invention: Singleton et al. , Dictionary of Microbiology 

7 
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and Molecular Biology (2nd ed. 1994); The Cambridge Dictionary of Science and 
Technology (Walker ed., 1988); The Glossary of Genetics, 5th Ed., R. Rieger et al. (eds.), 
Springer Verlag (1991); and Hale & Marham, The Harper Collins Dictionary of Biology 
(1991). As used herein, the following terms have the meanings ascribed to them unless 
5 specified otherwise. 

"Marker" in the context of the present invention refers to an organic 
biomolecule, particularly a polypeptide, which is differentially present in a sample taken 
from patients having prostate cancer, prostate intraepithelial neoplasia, or benign prostate 
hyperplasia as compared to a comparable sample taken from subjects who do not have 

1 0 prostate cancer {e.g. , negative diagnosis, normal or healthy subject). For example, a 
marker can be a polypeptide (having a particular apparent molecular weight) which is 
present at an elevated or decreased level in samples of prostate cancer patients compared 
to samples of patients with a negative diagnosis. 

"Organic biomolecule" refers to an organic molecule of biological origin, 

1 5 e.g., steroids, amino acids, nucleotides, sugars, polypeptides, polynucleotides, complex 
carbohydrates or lipids. 

The phrase "differentially present" refers to differences in the quantity of a 
polypeptide (of a particular apparent molecular weight) present in a sample taken from 
patients having prostate cancer as compared to a comparable sample taken from patients 

20 who do not have prostate cancer {e.g. , have benign prostate hyperplasia). A polypeptide 
is differentially present between the two samples if the amount of the polypeptide in one 
sample is significantly different from the amount of the polypeptide in the other sample. 
For example, a polypeptide is differentially present between the two samples if it is 
present in an amount (e.g., concentration, mass, molar amount, etc.) at least about 150%, 

25 at least about 200%, at least about 500% or at least about 1 000% greater than it is present 
in the other sample, or if it is detectable in one sample and not detectable in the other. 
Any polypeptides that are differentially present in samples taken from prostate cancer 
patients as compared to subjects who do not have prostate cancer {e.g., benign prostate 
hyperplasia patients) can be used as markers. 

30 "Diagnostic" means identifying the presence or nature of a pathologic 

condition. Diagnostic methods differ in their sensitivity and specificity. "Sensitivity" is 
defined as the percent of diseased (Benign Prostate Hyperplasia (BPH)/Prostate 
Intraepithelial Neoplasia (PIN)/Prostate Adenocarcinoma (PCA)) cell types for which the 
biomarker of interest is present, (true positive/total number of diseased lysates x 100). 
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"Specificity" is defined as the percent of cell types for which the biomarker of interest is 
not positive, (true negative/total number of lysates without disease * 100). Diseased 
individuals not detected by the assay are "false negatives." Subjects who are not diseased 
and who test negative in the assay, are termed "true negatives." While a particular 
5 diagnostic method may not provide a definitive diagnosis of a condition, it suffices if the 
method provides a positive indication that aids in diagnosis. 

A "test amount" of a marker refers to an amount of a marker present in a 
sample being tested. A test amount can be either in absolute amount (e.g., |ag/ml) or a 
relative amount (e.g., relative intensity of signals). 

10 A "diagnostic amount" of a marker refers to an amount of a marker in a 

subject's sample that is consistent with a diagnosis of prostate cancer. A diagnostic 
amount can be either in absolute amount (e.g., ng/ml) or a relative amount (e.g., relative 
intensity of signals). 

A "control amount" of a marker can be any amount or a range of amount 

1 5 which is to be compared against a test amount of a marker. For example, a control 
amount of a marker can be the amount of a marker (e.g., seminal basic protein) in a 
prostate cancer patient, a BPH patient or a person without prostate cancer or BPH. A 
control amount can be either in absolute amount (e.g., p.g/ml) or a relative amount (e.g., 
relative intensity of signals). 

20 "Probe" refers to a device that, when positionally engaged in an 

interrogatable relationship to an ionization source, e.g., a laser desorption/ionization 
source, and in concurrent communication at atmospheric or sub-atmospheric pressure 
with a detector of a gas phase ion spectrometer, can be used to introduce ions derived 
from an analyte into the spectrometer. As used herein, the "probe" is typically reversibly 

25 engageable (e.g., removably insertable) with a probe interface that positions the probe in 
an interrogatable relationship with the ionization source and in communication with the 
detector . A probe will generally comprise a substrate comprising a sample presenting 
surface on which an analyte is presented to the ionization source. 

"Substrate" or "probe substrate" refers to a solid phase onto which an 

30 adsorbent can be provided (e.g., by attachment, deposition, etc.) 

"Ionization source" refers to a device that directs ionizing energy to a 
sample presenting surface of a probe to desorb and ionize analytes from the probe surface 
into the gas phase. The preferred ionization source is a laser (used in laser 
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desorption/ionization), in particular, nitrogen lasers, Nd-Yag lasers and other pulsed laser 
sources. Other ionization sources include fast atoms (used in fast atom bombardment), 
plasma energy (used in plasma desorption) and primary ions generating secondary ions 
(used in secondary ion mass spectrometry). 
5 "Gas phase ion spectrometer" refers to an apparatus that detects gas phase 

ions. In the context of this invention, gas phase ion spectrometers include an ionization 
source used to generate the gas phase ions. Gas phase ion spectrometers include, for 
example, mass spectrometers, ion mobility spectrometers, and total ion current measuring 
devices. 

1 0 "Gas phase ion spectrometry" refers to a method comprising employing an 

ionization source to generate gas phase ions from an analyte presented on a sample 
presenting surface of a probe and detecting the gas phase ions with a gas phase ion 
spectrometer. 

"Mass spectrometer" refers to a gas phase ion spectrometer that measures 

15 a parameter which can be translated into mass-to-charge ratios of gas phase ions. Mass 
spectrometers generally include an inlet system, an ionization source, an ion optic 
assembly, a mass analyzer, and a detector. Examples of mass spectrometers are time-of- 
flight, magnetic sector, quadrapole filter, ion trap, ion cyclotron resonance, electrostatic 
sector analyzer and hybrids of these. 

20 "Mass spectrometry" refers to a method comprising employing an 

ionization source to generate gas phase ions from an analyte presented on a sample 
presenting surface of a probe and detecting the gas phase ions with a mass spectrometer. 

"Laser desorption mass spectrometer" refers to a mass spectrometer which 
uses laser as a means to desorb, volatilize and ionize an analyte. 

25 "Adsorbent" refers to any material capable of adsorbing a marker. The 

term "adsorbent" is used herein to refer both to a single material ("monoplex adsorbent") 
(e.g., a compound or functional group) to which the marker is exposed, and to a plurality 
of different materials ("multiplex adsorbent") to which the marker is exposed. The 
adsorbent materials in a multiplex adsorbent are referred to as "adsorbent species." For 

30 example, an addressable location on a probe substrate can comprise a multiplex adsorbent 
characterized by many different adsorbent species (e.g., anion exchange materials, metal 
chelators, or antibodies), having different binding characteristics. Substrate material itself 
can also contribute to adsorbing a marker and may be considered part of an "adsorbent." 



10 
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"Adsorption" or "retention" refers to the detectable binding between an 
absorbent and a marker either before or after washing with an eluant (selectivity threshold 
modifier) or a washing solution. 

"Eluant" or "washing solution" refers to an agent that can be used to 
5 mediate adsorption of a marker to an adsorbent. Eluants and washing solutions also are 
referred to as "selectivity threshold modifiers." Eluants and washing solutions can be 
used to wash and remove unbound materials from the probe substrate surface. 

"Resolve," "resolution," or "resolution of marker" refers to the detection 
of at least one marker in a sample. Resolution includes the detection of a plurality of 
1 0 markers in a sample by separation and subsequent differential detection. Resolution does 
not require the complete separation of a marker from all other markers in a mixture. 
Rather, any separation that allows the distinction between at least two markers suffices. 

"Detect" refers to identifying the presence, absence or amount of the 
object to be detected. 

1 5 The terms "polypeptide," "peptide" and "protein" are used interchangeably 

herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an analog or mimetic of a 
corresponding naturally occurring amino acid, as well as to naturally occurring amino 
acid polymers. Polypeptides can be modified, e.g., by the addition of carbohydrate 

20 residues to form glycoproteins. The terms "polypeptide," "peptide" and "protein" include 
glycoproteins, as well as non-glycoproteins. 

"Detectable moiety" or a "label" refers to a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For 
example, useful labels include 32 P, 35 S, fluorescent dyes, electron-dense reagents, 

25 enzymes (e.g. , as commonly used in an ELISA), biotin-streptavidin, digoxigenin, haptens 
and proteins for which antisera or monoclonal antibodies are available, or nucleic acid 
molecules with a sequence complementary to a target. The detectable moiety often 
generates a measurable signal, such as a radioactive, chromogenic, or fluorescent signal, 
that can be used to quantify the amount of bound detectable moiety in a sample. The 

30 detectable moiety can be incorporated in or attached to a primer or probe either 

covalently, or through ionic, van der Waals or hydrogen bonds, e.g., incorporation of 
radioactive nucleotides, or biotinylated nucleotides that are recognized by streptavidin. 
The detectable moiety may be directly or indirectly detectable. Indirect detection can 
involve the binding of a second directly or indirectly detectable moiety to the detectable 
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moiety. For example, the detectable moiety can be the ligand of a binding partner, such 
as biotin, which is a binding partner for streptavidin, or a nucleotide sequence, which is 
the binding partner for a complementary sequence, to which it can specifically hybridize. 
The binding partner may itself be directly detectable, for example, an antibody may be 
5 itself labeled with a fluorescent molecule. The binding partner also may be indirectly 
detectable, for example, a nucleic acid having a complementary nucleotide sequence can 
be a part of a branched DNA molecule that is in turn detectable through hybridization 
with other labeled nucleic acid molecules. (See, e.g., P. D. Fahrlander and A. Klausner, 
Bio/Technology 6:1165 (1988)). Quantitation of the signal is achieved by, e.g., 

1 0 scintillation counting, densitometry, or flow cytometry. 

"Measure" in all of its grammatical forms, refers to detecting, quantifying 
or qualifying the amount (including molar amount), concentration or mass of a physical 
entity or chemical composition either in absolute terms in the case of quantifying, or in 
terms relative to a comparable physical entity or chemical composition. 

1 5 "Antibody" refers to a polypeptide ligand substantially encoded by an 

immunoglobulin gene or immunoglobulin genes, or fragments thereof, which specifically 
binds and recognizes an epitope (e.g., an antigen). The recognized immunoglobulin 
genes include the kappa and lambda light chain constant region genes, the alpha, gamma, 
delta, epsilon and mu heavy chain constant region genes, and the myriad immunoglobulin 

20 variable region genes. Antibodies exist, e.g., as intact immunoglobulins or as a number 
of well characterized fragments produced by digestion with various peptidases. This 
includes, e.g., Fab' and F(ab)' 2 fragments. The term "antibody," as used herein, also 
includes antibody fragments either produced by the modification of whole antibodies or 
those synthesized de novo using recombinant DNA methodologies. It also includes 

25 polyclonal antibodies, monoclonal antibodies, chimeric antibodies, humanized antibodies, 
or single chain antibodies. "Fc" portion of an antibody refers to that portion of an 
immunoglobulin heavy chain that comprises one or more heavy chain constant region 
domains, CHI, CH2 and CH3, but does not include the heavy chain variable region. 

"Immunoassay" is an assay that uses an antibody to specifically bind an 

30 antigen. The immunoassay is characterized by the use of specific binding properties of a 
particular antibody to isolate, target, and/or quantify the antigen. 

The phrase "specifically (or selectively) binds" to an antibody or 
"specifically (or selectively) immunoreactive with," when referring to a protein or 
peptide, refers to a binding reaction that is determinative of the presence of the protein in 
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a heterogeneous population of proteins and other biologies. Thus, under designated 
immunoassay conditions, the specified antibodies bind to a particular protein at least two 
times the background and do not substantially bind in a significant amount to other 
proteins present in the sample. Specific binding to an antibody under such conditions 
5 may require an antibody that is selected for its specificity for a particular protein. For 
example, polyclonal antibodies raised to seminal basic protein from specific species such 
as rat, mouse, or human can be selected to obtain only those polyclonal antibodies that are 
specifically immunoreactive with seminal basic protein and not with other proteins, 
except for polymorphic variants and alleles of seminal basic protein. This selection may 

1 0 be achieved by subtracting out antibodies that cross-react with seminal basic protein 
molecules from other species. A variety of immunoassay formats may be used to select 
antibodies specifically immunoreactive with a particular protein. For example, solid- 
phase ELISA immunoassays are routinely used to select antibodies specifically 
immunoreactive with a protein {see, e.g. , Harlow & Lane, Antibodies, A Laboratory 

1 5 Manual (1 988), for a description of immunoassay formats and conditions that can be used 
to determine specific immunoreactivity). Typically a specific or selective reaction will be 
at least twice background signal or noise and more typically more than 1 0 to 1 00 times 
background. 

II. GAS PHASE ION SPECTROMETRY 

20 In one aspect, the invention provides methods for detecting markers which 

are differentially present in samples from patients with prostate cancer, prostate 
intraepithelial neoplasia, benign prostate hyperplasia and with negative diagnoses. Any 
one or combination of markers described are within the scope of this aspect of this 
invention and can be detected. The methods for detecting these markers have many 

25 applications. For example, one marker or combination of markers can be measured to 
differentiate between PCA or PIN and BPH, and thus are useful as an aid in the diagnosis 
of prostate cancer in a patient. In another example, the present methods for detecting 
these markers can be applied to in vitro prostate cancer cells or in vivo animal models for 
prostate cancer to assay for and identify compounds that modulate expression of these 

30 markers. In one embodiment of the detection method, the marker or markers are detected 
by gas phase ion spectrometry. In a preferred embodiment, the marker or markers are 
detected by mass spectrometry and, in particular, laser desorption mass spectrometry. 
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Probes suitable for use in the invention are described in, e.g., U.S. Patent 
Nos. 5,617,060 (Hutchens and Yip) and 6,225,047 (Hutchens and Yip). In one 
embodiment, a substrate comprising an adsorbent can be in the form of a probe, which is 
removably insertable into a gas phase ion spectrometer. For example, a substrate can be 
5 in the form of a strip with adsorbents on its surface. In another embodiment, a substrate 
comprising an adsorbent can be positioned onto another substrate to form a probe, which 
is removably insertable into a gas phase ion spectrometer. For example, a substrate 
comprising an adsorbent can be a solid phase, such as a polymeric or glass bead with a 
functional group for binding a marker, which can be subsequently positioned on a second 

10 substrate to form a probe. For example, the second substrate can be in the form of a strip, 
or a plate having a series of wells at a predetermined addressable locations. One 
advantage of this embodiment is that the marker can be adsorbed to the first substrate in 
one physical context, and transferred to the second substrate, which can then be submitted 
for analysis by gas phase ion spectrometry. The probe can be in any shape as long as it is 

1 5 removably insertable into a gas phase ion spectrometer. 

The probe can also be adapted for use with inlet systems and detectors of a 
gas phase ion spectrometer. For example, the probe can be adapted for mounting in a 
horizontally, vertically and/or rotationally translatable carriage that moves the probe to a 
successive position without requiring repositioning of the probe by hand. 

20 The probe substrate is preferably made of a material that is capable of 

supporting adsorbents. For example, the probe substrate material can include, but is not 
limited to, insulating materials (e.g., glass, ceramic), semi-insulating materials {e.g., 
silicon wafers), or electrically conducting materials (e.g., metals, such as copper, brass, 
steel, aluminum, gold, or electrically conductive polymers), organic polymers, 

25 biopolymers, or any combinations thereof. 

The probe substrate surface can be conditioned to bind markers. For 
example, in one embodiment, the surface of the probe substrate can be conditioned (e.g., 
chemically or mechanically such as roughening) to place adsorbents on the surface. The 
adsorbent comprises functional groups for binding with a marker. In some embodiments, 

30 the substrate material itself can also contribute to adsorbent properties and may be 
considered part of an "adsorbent." 

Any number of different adsorbents can be used as long as they have 
binding characteristics suitable for binding the markers of the present invention. The 
adsorbents can comprise a hydrophobic group, a hydrophilic group, a cationic group, an 
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anionic group, a metal ion chelating group, or antibodies which specifically bind to 
antigens, or a combination thereof (sometimes referred to as "a mixed mode" adsorbent). 
Exemplary adsorbents comprising a hydrophobic group include matrices having aliphatic 
hydrocarbons, e.g., Q-C18 aliphatic hydrocarbons and matrices having aromatic 
5 hydrocarbon functional group such as phenyl groups. Exemplary adsorbents comprising 
a hydrophilic group include silicon oxide (i.e., glass), or hydrophilic polymers such as 
polyethylene glycol, dextran, agarose, or cellulose. Exemplary adsorbents comprising a 
cationic group include matrices of secondary, tertiary or quaternary amines. Exemplary 
adsorbents comprising an anionic group include matrices of sulfate anions (S0 3 ") and 

10 matrices of carboxylate anions (i.e., COO") or phosphate anions (OP0 3 ')- Exemplary 

adsorbents comprising metal chelating groups include organic molecules that have one or 
more electron donor groups which form coordinate covalent bonds with metal ions, such 
as copper, nickel, cobalt, zinc, iron, and other metal ions such as aluminum and calcium. 
Exemplary adsorbents comprising an antibody include antibodies that are specific for any 

15 one of the markers provided herein. In preferred embodiments, adsorbents are 

substantially similar to or the same as the adsorbents which were used to enrich and 
identify the markers. 

One useful chip is the Normal Phase ProteinChip® array, available from 
Ciphergen Biosystems, Inc. (Palo Alto, CA). The normal phase chip has a hydrophilic 

20 adsorbent comprising silicon oxide (Si0 2 ) on the substrate surface. Silicon oxide can be 
applied to the surface by any of a number of well known methods. These methods 
include, for example, vapor deposition, e.g., sputter coating. A preferred thickness for 
such a probe is about 9000 Angstroms. 

Another useful chip is the SAX1 ProteinChip® array made by Ciphergen 

25 Biosystems, Inc. in Palo Alto, CA. The SAX1 protein chips are fabricated from Si02 
coated aluminum substrates. In the process, a suspension of quaternary ammonium 
polystryenemicrospheres in distilled water is deposited onto the surface of the chip (1 
mL/spot, two times). After air drying (room temperature, 5 minutes), the chip is rinsed 
with deionized water and air dried again (room temperature, 5 minutes). 

30 Another useful chip is the IMAC3 (Immobilized Metal Affinity Capture, 

nitrilotriacetic acid on surface) ProteinChip® array, also available from Ciphergen 
Biosystems, Inc. The chips are produced as follows: 5-Methacylamido-2-(N,N- 
biscarboxymemaylamino)pentanoic acid (7.5 wt %),Acryloyltri- 
(hydroxymemyl)memylamine (7.5 wt %) and NJ^-methylenebisacrylamide (0.4 wt %) 
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are photo-polymerized using-(-)riboflavin (0.02 wt %) as a photo-initiator. The monomer 
solution is deposited onto a rough etched, glass coated substrate (0.4 mL, twice) and 
irradiated for 5 minutes with a near UV exposure system (Hg short arc lamp, 20 mW/cm 2 
at 365 nm). The surface is washed with a solution of sodium chloride (1 M) and then 
5 washed twice with deionized water. 

The IMAC3 with Cu(IT) is activated as follows. The surface is treated 
with a solution of CuS0 4 (100 mM, 10 uL/spot) and mixed on a high frequency mixer for 
1 0 minutes. After removing the CuS0 4 solution, the chips are rinsed with water and each 
spot is treated with a solution of sodium acetate (100 mM, pH 4.0, 10 pi per spot). The 
10 sodium acetate is removed and the surface is washed with a stream of deionized water (15 
sec/chip). 

Adsorbents can be placed on the probe substrate in continuous or 
discontinuous patterns. If continuous, one or more adsorbents can be placed on the 
substrate surface. If multiple types of adsorbents are used, the substrate surface can be 

15 coated such that one or more binding characteristics vary in one or two-dimensional 

gradient. If discontinuous, plural adsorbents can be placed in predetermined addressable 
locations on the substrate surface. The addressable locations can be arranged in any 
pattern, but are preferably in regular pattern, such as lines, orthogonal arrays, or regular 
curves {e.g., circles). Each addressable location may comprise the same or different 

20 adsorbent. The spots are "addressable" in that during mass spectrometry an energy 
source, such as a laser, may be directed to (i.e., "addresses") each spot differentially to 
desorb the analyte. 

The probes can be produced using any suitable methods depending on the 
selection of substrate materials and/or adsorbents. For example, the surface of a metal 

25 substrate can be coated with a material that allows derivitization of the metal surface. 
More specifically, a metal surface can be coated with silicon oxide, titanium oxide or 
gold. Then surface can be derivatized with a bifunctional linker, one end of which can 
covalently bind with a functional group on the surface and the other end of which can be 
further derivatized with groups that function as an adsorbent. In another example, a 

30 porous silicon surface generated from crystalline silicon can be chemically modified to 
include adsorbents for binding markers. In yet another example, adsorbents with a 
hydrogel backbone can be formed directly on the substrate surface by in situ 
polymerizing a monomer solution which comprises, e.g., substituted acrylamide 
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monomers, substituted acrylate monomers, or derivatives thereof comprising a functional 
group of choice as an adsorbent. 

The probe substrate comprising an adsorbent contacts a sample. The 
sample is preferably a biological fluid sample. Examples of biological fluid samples 
5 include blood, serum, urine, prostatic fluid, seminal fluid, semen, seminal plasma and 
prostate tissue (e.g., epithelial tissue, including extracts thereof). 

The sample can be solubilized in or admixed with an eluant. The probe 
substrate comprising an adsorbent then contacts the solution using any techniques 
including bathing, soaking, dipping, spraying, washing over, or pipetting, etc. Generally, 
1 0 a volume of sample containing from a few attomoles to 1 00 picomoles of marker in about 
1 fil to 500 fA is sufficient for binding to the adsorbent. 

The sample can contact the probe substrate comprising an adsorbent for a 
period of time sufficient to allow the marker to bind to the adsorbent. Typically, the 
sample and the substrate comprising the adsorbent are contacted for a period of between 
15 about 30 seconds and about 12 hours, and preferably, between about 30 seconds and 
about 15 minutes. 

The temperature at which the sample contacts the probe substrate 
comprising an adsorbent can be a function of the particular sample and the selected probe. 
Typically, the sample is contacted to the probe substrate under ambient temperature and 
20 pressure conditions. For some samples, however, modified temperature (typically 4°C 
through 37°C), and pressure conditions can be desirable, which conditions are 
determinable by those skilled in the art. 

After the probe substrate comprising an adsorbent contacts the sample or 
sample solution, it is preferred that unbound materials on the probe substrate surface are 
25 washed out so that only the bound materials remain on the substrate surface. Washing a 
probe substrate surface can be accomplished by, e.g., bathing, soaking, dipping, rinsing, 
spraying, or washing the substrate surface with an eluant or a washing solution. A 
microfluidics process is preferably used when a washing solution such as an eluant is 
introduced to small spots of adsorbents on the probe. Typically, the washing solution can 
30 be at a temperature of between 0°C and 100°C, preferably between 4°C and 37°C. 

Any suitable washing solutions or eluants can be used to wash the probe 
substrate surface. For example, organic solutions or aqueous solutions can be used. 
Preferably, an aqueous solution is used. Exemplary aqueous solutions include a HEPES 
buffer, a Tris buffer, a phosphate buffered saline, etc. The selection of a particular 
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washing solution or an eluant is dependent on other experimental conditions (e.g., types 
of adsorbents used or markers to be detected), and can be determined by those of skill in 
the art. For example, if a probe comprising a hydrophobic group and a sulfonate group as 
adsorbents (e.g., SAX1 ProteinChip® array) is used, then an aqueous solution, such as a 
5 HEPES buffer, may be preferred. In another example, if a probe comprising a metal 
binding group as an adsorbent (e.g. , Cu(II) ProteinChip® array) is used, then an aqueous 
solution, such as a phosphate buffered saline, may be preferred. In yet another example, 
if a probe comprising a hydrophobic group (e.g., H4 ProteinChip® array) is used, then 
water may be preferred as a washing solution. 

1 0 An energy absorbing molecule (e.g., in solution) can be applied to markers 

or other substances bound on the probe substrate surface. Spraying, pipetting, or dipping 
can be used. This can be done after unbound materials are washed off of the probe 
substrate surface. An energy absorbing molecule refers to a molecule that absorbs energy 
from an energy source in a gas phase ion spectrometer, thereby assisting desorption of 

1 5 markers or other substances from a probe surface. Exemplary energy absorbing 

molecules include cinnamic acid derivatives, sinapinic acid and dihydroxybenzoic acid. 

After the marker is bound to the probe, it is detected using gas phase ion 
spectrometry. Markers or other substances bound to the adsorbents on the probes can be 
analyzed using a gas phase ion spectrometer. The quantity and characteristics of the 

20 marker can be determined using gas phase ion spectrometry. Other substances in addition 
to the marker of interest can also be detected by gas phase ion spectrometry. 

In one embodiment, a mass spectrometer can be used to detect markers on 
the probe. In a typical mass spectrometer, a probe with a marker is introduced into an 
inlet system of the mass spectrometer. The analyte is then desorbed by a desorption 

25 source such as a laser, fast atom bombardment, or high energy plasma. The generated 
desorbed, volatilized species consist of preformed ions or neutrals which are ionized as a 
direct consequence of the desorption event. Generated ions are collected by an ion optic 
assembly, and then a mass analyzer disperses and analyzes the passing ions. The ions 
exiting the mass analyzer are detected by a detector. The detector then translates 

30 information of the detected ions into mass-to-charge ratios. Detection of the presence of 
a marker or other substances will typically involve detection of signal intensity. This, in 
turn, can reflect the quantity and character of a marker bound to the probe. 

In a preferred embodiment, a laser desorption time-of-flight mass 
spectrometer is used with the probe of the present invention. In laser desorption mass 
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spectrometry, a probe with a bound analyte is introduced into an inlet system. The 
analyte is desorbed and ionized into the gas phase by laser from the ionization source. 
The ions generated are collected by an ion optic assembly, and then in a time-of-flight 
mass analyzer, ions are accelerated through a short high voltage field and let drift into a 
5 high vacuum chamber. At the far end of the high vacuum chamber, the accelerated ions 
strike a sensitive detector surface at a different time. Since the time-of-flight is a function 
of the mass of the ions, the elapsed time between ion formation and ion detector impact 
can be used to identify the presence or absence of molecules of specific mass to charge 
ratio. As any person skilled in the art understands, any of these components of the laser 

1 0 desorption time-of-flight mass spectrometer can be combined with other components 
described herein in the assembly of mass spectrometer that employs various means of 
desorption, acceleration, detection, measurement of time, etc. 

In another embodiment, an ion mobility spectrometer can be used to detect 
and characterize a marker. The principle of ion mobility spectrometry is based on 

1 5 different mobility of ions. Specifically, ions of a sample produced by ionization move at 
different rates, due to their difference in, e.g., mass, charge, or shape, through a tube 
under the influence of an electric field. The ions (typically in the form of a current) are 
registered at the detector which can then be used to identify a marker or other substances 
in the sample. One advantage of ion mobility spectrometry is that it can operate at 

20 atmospheric pressure. 

In yet another embodiment, a total ion current measuring device can be 
used to detect and characterize markers. This device can be used when the probe has a 
surface chemistry that allows only a single type of marker to be bound. When a single 
type of marker is bound on the probe, the total current generated from the ionized marker 

25 reflects the nature of the marker. The total ion current produced by the marker can then 
be compared to stored total ion current of known compounds. Characteristics of the 
marker can then be determined. 

Data generated by desorption and detection of markers can be analyzed 
with the use of a programmable digital computer. The computer program generally 

30 contains a readable medium that stores codes. Certain code can be devoted to memory 
that includes the location of each feature on a probe, the identity of the adsorbent at that 
feature and the elution conditions used to wash the adsorbent. Using this information, the 
program can then identify the set of features on the probe defining certain selectivity 
characteristics (e.g., types of adsorbent and eluants used). The computer also contains 
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code that receives as input, data on the strength of the signal at various molecular masses 
received from a particular addressable location on the probe. This data can indicate the 
number of markers detected, optionally including the strength of the signal and the 
determined molecular mass for each marker detected. 
5 Data analysis can include the steps of determining signal strength (e.g., 

height of peaks) of a marker detected and removing "outliers" (data deviating from a 
predetermined statistical distribution). For example, the observed peaks can be 
normalized, a process whereby the height of each peak relative to some reference is 
calculated. For example, a reference can be background noise generated by instrument 

10 and chemicals (e.g., energy absorbing molecule) which is set as zero in the scale. Then 
the signal strength detected for each marker or other substances can be displayed in the 
form of relative intensities in the scale desired (e.g., 100). Alternatively, a standard may 
be admitted with the sample so that a peak from the standard can be used as a reference to 
calculate relative intensities of the signals observed for each marker or other markers 

15 detected. 

The computer can transform the resulting data into various formats for 
displaying. In one format, referred to as "spectrum view or retentate map," a standard 
spectral view can be displayed, wherein the view depicts the quantity of marker reaching 
the detector at each particular molecular weight. In another format, referred to as "peak 

20 map," only the peak height and mass information are retained from the spectrum view, 
yielding a cleaner image and enabling markers with nearly identical molecular weights to 
be more easily seen. In yet another format, referred to as "gel view," each mass from the 
peak view can be converted into a grayscale image based on the height of each peak, 
resulting in an appearance similar to bands on electrophoretic gels. In yet another format, 

25 referred to as "3-D overlays," several spectra can be overlaid to study subtle changes in 
relative peak heights. In yet another format, referred to as "difference map view," two or 
more spectra can be compared, conveniently highlighting unique markers and markers 
which are up-; or down-regulated between samples. Marker profiles (spectra) from any 
two samples may be compared visually. 
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III. METHODS OF DETECTING MARKERS 

A. Characterization Of The Markers 

A set of markers was identified that is useful in distinguishing prostate 
cancer, prostate intraepithelial neoplasia, benign prostate hyperplasia and a negative 
5 diagnosis. The set was identified in cell lysate from prostate epithelial tissue. 

1. Cell Lysate Marker Set 

A set of markers useful for diagnosing prostate cancer, prostate 
intraepithelial neoplasia, benign prostate hyperplasia and a negative diagnosis has the 
following molecular weights as determined by mass spectrometry: Marker EP1 : 3448 ± 

10 1 9 Da, Marker EP2: 4036 ± 22 Da, Marker EP3: 4361 ± 24 Da, Marker EP4: 441 3 ± 24 
Da, Marker EP5: 4639 ± 26 Da, Marker EP6: 4749 ± 26 Da, Marker EP7: 4827± 27 Da, 
Marker EP8: 5666 ± 3 1 Da, Marker EP9: 8445 ± 46 Da, Marker EP10: 1 1744 ± 65 Da, 
Marker EP11: 14696± 81 Da, Marker EP1 2: 24184 ± 133 Da, Marker EP 13: 48308 ± 
266 Da, and Marker EP14: 53830 ± 296 Da. 

15 These markers are further characterized as follows: Cell lysate from 

prostate epithelial cells isolated by laser capture microdissection was fractionated and 
analyzed by retentate chromatography. In this method, test samples were applied to a 
substrate comprising a copper chelate adsorbent. The adsorbents were washed with a first 
eluent of 20 mM HEPES, pH 8.0, 0.1% TritonX 100, to allow binding of proteins to the 

20 chip, and with a second eluent of phosphate buffered saline (PBS) to remove non-bound 
materials. Markers retained on the adsorbents were detected by SELDI mass 
spectrometry. The fourteen markers could be resolved by the metal chelate adsorbent. 

B. Sample Sources For Markers 

These markers can be detected in many different samples derived from a 
25 patient or subject. The sample is preferably a biological fluid sample. Examples of 
biological fluid samples useful in this invention include blood, serum, urine, prostatic 
fluid, seminal fluid, semen, seminal plasma and prostate tissue {e.g., epithelial tissue, 
including extracts thereof). 

Because the markers in the Marker Set were discovered in cell lysate from 
30 prostate epithelium, this tissue is a preferred sample source for the methods of this 

invention. More particularly, the samples were collected by laser capture microdissection 
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of this tissue. Therefore, this is a preferred method of obtaining sample for the methods 
of this invention. 



C. Detection of Markers 

After a sample is obtained, any suitable method can be used to detect the 
5 marker in a sample from a subject being tested. For example, gas phase ion spectrometry 
or an immunoassay can be used. 

1. Mass Spectrometry 

In a preferred embodiment, the markers of this invention are detected 
using mass spectrometry, more preferably using gas phase ion spectrometry and, still 

1 0 more preferably, using surface-enhanced laser desorption/ionization mass spectrometry 
("SELDI"). SELDI is an improved method of gas phase ion spectrometry for 
biomolecules. In SELDI, the surface on which the analyte is applied plays an active role 
in the analyte capture, desorption and/or desorption. 

One popular method of gas phase ion spectrometry for biomolecules is 

1 5 MALDI (matrix-assisted laser desorption/ionization) mass spectrometry. In MALDI, the 
analyte is typically mixed with a matrix material that, upon drying, forms crystals that 
capture the analyte. The matrix material absorbs energy from the energy source which 
otherwise would fragment the bimolecular analytes. 

20 a) Preparation of sample 

(i) Pre-fractionation 

In one embodiment, the sample can be pre-fractionated before being 

subjected to gas phase ion spectrometry. Pre-fractionation has the advantage of providing 

a less complex sample for analysis. On the other hand, it introduces an extra step in the 
25 analytic process that could be unattractive in, for example, a clinical setting. Samples can 

be pre-fractionated by any means known in the art, including, without limitation, size 

fractionation and chromatographic fractionation. 

In one embodiment, seminal plasma samples can be pre-fractionated 

before analysis by gas-phase ion spectrometry. A preferred method of fraction includes a 
30 first fractionation by gel exclusion chromatography. Sizing columns which exclude 

molecules whose molecular mass is greater than 30 kDa are particularly useful for this. 

Fractions of various sizes then can be examined directly or subjected to a second 
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fractionation step based on anion exchange chromatography. Using an anion exchange Q 
spin column, markers can be eluted using a low strength buffer (e.g., about 10 mM to 50 
mM Tris, HEPES or PBS) with salt at low to medium concentration (e.g., about 0.1 M to 
0.6 M) and a non-ionic detergent at low concentration (e.g., TritonX 100 at about 0.05 to 
5 0.2%). A particularly useful buffer is 20 mM Tris, 0.5 M NaCl and 0.1% TritonX 100. 
In another embodiment, the markers are eluted using a pH gradient. 

(ii) Retentate Chromatography 
In another embodiment, the sample is fractionated on a bio- 
chromatographic chip by retentate chromatography before gas phase ion spectrometry. A 

10 preferred chip is the Protein Chip® array available from Ciphergen Biosystems, Inc. 
(Palo Alto, CA). As described above, the chip or probe is adapted for use in a mass 
spectrometer. The chip comprises an adsorbent attached to its surface. This adsorbent 
can function, in certain applications, as an in situ chromatography resin. In basic 
operation, the sample is applied to the adsorbent in an eluent solution. Molecules for 

1 5 which the adsorbent has affinity under the wash condition bind to the adsorbent. 

Molecules that do not bind to the adsorbent are removed with the wash. The adsorbent 
can be further washed under various levels of stringency so that analytes are retained or 
eluted to an appropriate level for analysis. Then, an energy absorbing molecule can be 
added to the adsorbent spot to further facilitate desorption and ionization. The analyte is 

20 detected by desorption from the adsorbent, ionization and direct detection by a detector. 
Thus, retentate chromatography differs from traditional chromatography in that the 
analyte retained by the affinity material is detected, whereas in traditional 
chromatography, material that is eluted from the affinity material is detected. 

A useful adsorbent for the markers of the Marker Set is a metal chelate 

25 adsorbent and, in particular, copper. This surface also is usefully washed with a pH 

neutral buffer such as PBS. A preferred surface is MAC3 from Ciphergen Biosystems, 
Inc. IMAC3 comprises a copper chelate adsorbent. 

Another useful adsorbent for any of the markers of this invention is an 
antibody that specifically binds the marker. Chips comprising antibodies that bind to one 

30 or more markers are particularly useful for removing non-markers which do not bind to 
the antibodies and which function as "noise" in the detection process. 

As will be evident to anyone skilled in the art, different markers may be 
more easily resolved using different combinations of adsorbents and eluants. 
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(iii) Mixing the sample with an energy absorbing 
matrix 

In MALDI applications the sample to be analyzed is mixed with an energy 
absorbing matrix prior to ionization and mass analysis. The sample/matrix mixture is 
5 then applied to the surface of an inert mass spectrometer probe. Suitable matrix materials 
are well know to those of skill in the art and include 3-hydroxypicolinic acid (3-hydroxy- 
2-pyridinecarboxylic acid), nicotinic acid, N-oxide, 2*-6'-dihydroxyacetophenone, gentisic 
acid (2,5-dihydroxybenzoic acid), a-cyano-4-hydroxycinnamic acid, ferulic acid (4- 
hydroxy-3-methoxycinnamic acid) and sinapinic acid (3,5-dimethoxy-4-hydroxycinnamic 
10 acid). 

The resolving power of MALDI is limited by the complexity of the sample 
being analyzed. Therefore prefractionation or otherwise purifying the sample prior to 
MALDI analysis is preferred. 

(iv) Modification of Marker Before Analysis 

15 In another embodiment, the markers are modified before detection in order 

to alter their molecular weight. These methods may decrease ambiguity of detection. For 
example, the markers may be subject to proteolytic digestion before analysis. Any 
protease can be used. Proteases such as trypsin, that are likely to cleave the markers into 
a discrete number of fragments are particularly useful. The fragments that result from 

20 digestion function as a fingerprint for the markers, thereby enabling their detection 
indirectly. This is particularly useful where there are markers with similar molecular 
masses that might be confused for the marker in question. Also, proteolytic 
fragmentation is useful for high molecular weight markers because smaller markers are 
more easily resolved by mass spectrometry. 

25 In another embodiment, the markers can be modified by the attachment of 

a tag of particular molecular weight that bind specifically to molecular markers, further 
distinguishing them. 

b) Performance of laser desorption/ionization mass 
spectrometry 

30 After the marker is detected by mass spectrometry, preferably gas phase 

ion spectrometry, a test amount of marker can be determined. For example, a signal is 
displayed at the molecular weight of the marker of interest. Based on the strength or 
magnitude of the displayed signal, the amount of marker in a sample being tested can be 
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determined. It is noted that the test amount of marker in a sample need not be measured 
in absolute units, but can be in relative units as long as it can be compared qualitatively or 
quantitatively to a control amount of a marker. For example, the amount of the marker 
detected can be displayed in terms of relative intensity based on the background noise. 
5 Preferably, the test amount and the control amount of markers are measured under the 
same conditions. 

If desired, the absolute amount of a marker can be determined by 
calibration. For example, a purified, known marker can be added in increasing amounts 
to different spots of adsorbents on the probe surface. Then peaks from each spot can be 
10 obtained and plotted in a graph against the concentration of known marker protein at each 
spot. From the peak intensity vs. concentration plot, the absolute amount of a marker in 
any sample being tested can be determined. 

2. Immunoassay Detection 

In another embodiment of the detection method, an immunoassay can be 

15 used to qualitatively or quantitatively detect and analyze markers in a sample. This 
method comprises: (a) providing an antibody that specifically binds to a marker; (b) 
contacting a sample with the antibody; and (c) detecting the presence of a complex of the 
antibody bound to the marker in the sample. 

To prepare an antibody that specifically binds to a marker, purified 

20 markers or their nucleic acid sequences can be used. Nucleic acid and amino acid 

sequences for markers can be obtained by further characterization of these markers. For 
example, each marker can be peptide mapped with a number of enzymes (e.g., trypsin, 
V8 protease, etc.). The molecular weights of digestion fragments from each marker can 
be used to search the databases, such as SwissProt database, for sequences that will match 

25 the molecular weights of digestion fragments generated by various enzymes. Using this 
method, the nucleic acid and amino acid sequences of other markers can be identified if 
these markers are known proteins in the databases. 

Alternatively, the proteins can be sequenced using protein ladder 
sequencing. Protein ladders can be generated by, for example, fragmenting the molecules 

30 and subjecting fragments to enzymatic digestion or other methods that sequentially 

remove a single amino acid from the end of the fragment. Methods of preparing protein 
ladders are described, for example, in International Publication WO 93/24834 (Chait et 
al.) and United States Patent 5,792,664 (Chait et al.). The ladder is then analyzed by 
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mass spectrometry. The difference in the masses of the ladder fragments identify the 
amino acid removed from the end of the molecule. 

If the markers are not known proteins in the databases, nucleic acid and 
amino acid sequences can be determined with knowledge of even a portion of the amino 
5 acid sequence of the marker. For example, degenerate probes can be made based on the 
N-terminal amino acid sequence of the marker. These probes can then be used to screen a 
genomic or cDNA library created from a sample from which a marker was initially 
detected. The positive clones can be identified, amplified, and their recombinant DNA 
sequences can be subcloned using techniques which are well known. See, e.g., Current 

1 0 Protocols for Molecular Biology (Ausubel et al. , Green Publishing Assoc. and Wiley- 
Interscience 1989) and Molecular Cloning: A Laboratory Manual, 2nd Ed. (Sambrook et 
al, Cold Spring Harbor Laboratory, NY 1989). 

Using the purified markers or their nucleic acid sequences, antibodies that 
specifically bind to a marker can be prepared using any suitable methods known in the 

15 art. See, e.g., Coligan, Current Protocols in Immunology (1991); Harlow & Lane, 

Antibodies: A Laboratory Manual (1988); Goding, Monoclonal Antibodies: Principles 
and Practice (2d ed. 1986); and Kohler & Milstein, Nature 256:495-497 (1975). Such 
techniques include, but are not limited to, antibody preparation by selection of antibodies 
from libraries of recombinant antibodies in phage or similar vectors, as well as 

20 preparation of polyclonal and monoclonal antibodies by immunizing rabbits or mice {see, 
e.g., Huse et al, Science 246:1275-1281 (1989); Ward et al, Nature 341 :544-546 
(1989)). 

After the antibody is provided, a marker can be detected and/or quantified 
using any of a number of well recognized immunological binding assays {see, e.g., U.S. 

25 Patents 4,366,241; 4,376,1 10; 4,517,288; and 4,837,168). Useful assays include, for 
example, an enzyme immune assay (EIA) such as enzyme-linked immunosorbent assay 
(ELISA), a radioimmune assay (RIA), a Western blot assay, or a slot blot assay. For a 
review of the general immunoassays, see also, Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993); Basic and Clinical Immunology (Stites & Terr, 

30 eds., 7th ed. 1991). 

Generally, a sample obtained from a subject can be contacted with the 
antibody that specifically binds the marker. Optionally, the antibody can be fixed to a 
solid support to facilitate washing and subsequent isolation of the complex, prior to 
contacting the antibody with a sample. Examples of solid supports include glass or 
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plastic in the form of, e.g., a microtiter plate, a stick, a bead, or a microbead. Antibodies 
can also be attached to a probe substrate or ProteinChip® array described above. (See 
e.g. Xiao et al, Cancer Research 62: 6029-6033 (2001)) The sample is preferably a 
biological fluid sample taken from a subject. Examples of biological fluid samples 
5 include blood, serum, urine, prostatic fluid, seminal fluid, semen, seminal plasma and 
prostate tissue (e.g., epithelial tissue, including extracts thereof). In a preferred 
embodiment, the biological fluid comprises seminal plasma. The sample can be diluted 
with a suitable eluant before contacting the sample to the antibody. 

After incubating the sample with antibodies, the mixture is washed and the 

1 0 antibody-marker complex formed can be detected. This can be accomplished by 

incubating the washed mixture with a detection reagent. This detection reagent may be, 
e.g., a second antibody which is labeled with a detectable label. Exemplary detectable 
labels include magnetic beads (e.g., DYNABEADS™), fluorescent dyes, radiolabels, 
enzymes (e.g., horse radish peroxide, alkaline phosphatase and others commonly used in 

15 an ELIS A), and colorimetric labels such as colloidal gold or colored glass or plastic 
beads. Alternatively, the marker in the sample can be detected using an indirect assay, 
wherein, for example, a second, labeled antibody is used to detect bound marker-specific 
antibody, and/or in a competition or inhibition assay wherein, for example, a monoclonal 
antibody which binds to a distinct epitope of the marker are incubated simultaneously 

20 with the mixture. 

Throughout the assays, incubation and/or washing steps may be required 
after each combination of reagents. Incubation steps can vary from about 5 seconds to 
several hours, preferably from about 5 minutes to about 24 hours. However, the 
incubation time will depend upon the assay format, marker, volume of solution, 

25 concentrations and the like. Usually the assays will be carried out at ambient 

temperature, although they can be conducted over a range of temperatures, such as 10°C 
to 40°C. 

The immunoassay techniques are well-known in the art, and a general 
overview of the applicable technology can be found in Harlow & Lane, supra. 
30 The immunoassay can be used to determine a test amount of a marker in a 

sample from a subject. First, a test amount of a marker in a sample can be detected using 
the immunoassay methods described above. If a marker is present in the sample, it will 
form an antibody-marker complex with an antibody that specifically binds the marker 
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under suitable incubation conditions described above. The amount of an antibody-marker 
complex can be determined by comparing to a standard. As noted above, the test amount 
of marker need not be measured in absolute units, as long as the unit of measurement can 
be compared to a control amount. 

5 IV. METHODS OF DIAGNOSIS USING MARKERS 

The markers in the Marker Set may be used alone, in combination with 
other markers in the set, or with other markers entirely (e.g., PSA or prostate specific 
membrane antigen) to aid in the diagnosis of prostate cancer, prostate intraepithelial 
neoplasia, benign prostate hyperplasia or a negative diagnosis. Used together, they tend 

1 0 to provide more information for the diagnostician, increasing the percentage of true 
positive and true negative diagnoses and decreasing the percentage of false positive or 
false negative diagnoses, than a single marker alone. 

In certain embodiments, it is useful to detect the presence or absence of a 
marker, without quantifying the amount of marker. This information is useful because 

1 5 the markers show differences in frequency of detection between PCA, PIN, BPH and 
negative diagnosis. In other embodiments, detecting the presence or absence of the 
markers can also involve quantifying the markers. This information is useful for several 
purposes. First, individual markers may be slightly detectable in certain states based on a 
weak peak, but easily detectable in other states based on a strong peak. Second, certain of 

20 the markers are frequently detected in all states, but their relative ratio differs 
dramatically among PCA PIN, BPH or normal samples. 

It is, of course, understood by diagnosticians that diagnostic tests are 
measured by their degree of specificity and sensitivity. Most diagnostic tests suffer from 
some imperfection in their ability to provide a positive result for every true positive 

25 diagnosis, as well as some imperfection in their ability to provide a negative result for 

every true negative diagnosis. However, tests which are not perfectly specific or sensitive 
are useful in diagnosis because they render certain diagnoses more likely than not. For 
example, markers EP5 and EP12 are detected in 43% of all samples positive for PCA and 
0% of negative samples. Thus, detection of EP5 or EP12 is useful in that it provides a 

30 probable diagnosis of PCA. Failure to detect EP5 or EP 1 2 also is useful in that it 

provides a probable negative diagnosis. Furthermore, used in combination with other 
markers, the diagnostician can increase their confidence in a PCA or negative diagnosis. 
With this in mind, the markers have frequency distributions as follows. 
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A. Cell Lysate Marker Set 

Table 1 is a comparison of the relative abundance levels of markers EP1 to 
EP14 in BPH, PIN, and PCA cell lysates. The relative quantity can be determined in gas 
phase ion spectrometry either by peak height or peak area. Preferably, this determination 
5 is normalized with respect to an internal standard, which is present in all samples. 
Markers EP2, EP3, EP5, and EP6 occur in a higher abundance in both PIN and PCA 
samples. Therefore, detection of two or three of these markers is positively correlated 
with a pathologic diagnosis. EP8 occurs in higher abundance in BPH samples. 
Therefore, detection of this marker is positively correlated with a normal or benign 
10 diagnosis. 

Table 2 displays the percent of BPH, PIN, or PCA prostate cell lysates 
found to be over- or under-expressing markers EP1-EP14. Marker EP 1 appears to be 
expressed in all cell types but is present in increasing amounts in BPH, PIN, and PCA cell 
lysates. EP10 appears to be under-expressed in BPH and PIN cell lysates and a majority 
15 of the PCA cell lysates. EP13 appears to be under-expressed in a majority of the PIN and 
PCA cell lysates. 

Table 3 shows the statistical analysis of the differential expression of 
selected proteins as determined by the Wilcoxon signed ranks test. The differential 
expression of EP2, EP3, and EP6 was significant in differentiating PIN and/or PCA from 
20 non-cancer cells; while the differential expression of EP8 was significant in 
distinguishing BPH from PIN and PCA. 

Table 4 displays the sensitivity and specificity of single markers and 
combination of markers found in PIN and PCA cell lysates. Combining markers EP2 and 
EP5, EP3 and EP5, or EP3 and EP6 increased the sensitivity for identifying PIN or PCA 
25 samples. Therefore, detection of these combinations of markers is positively correlated 
with a pathological diagnosis. 

Table 5 shows that by combining the expression levels of seven 
biomarkers, EP2, EP3, EP4, EP5, EP6, EP8, and PSA, a specificity of 93.3% and 
sensitivity of 93.8%, with an overall accuracy of 93.5% was achieved in differentiating 
30 normal (N/BPH) from cancer (PIN/PCA). 

B. Detection of Markers: Comparison with Controls 

Detection of markers can usefully involve comparing the test data to a 
control. The control can be useful for calibration of "normal" amounts and for 
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quantifying the amount of a marker against a normal or control amount. For example, it 
can be useful to know whether a marker present in both normal and pathologic conditions 
is increased by 1.5 fold, by 2 fold, by 5 fold, or by 10 fold compared to the control 
amount. 

5 In one embodiment, the control amount can be an amount of a marker 

present in comparable samples from normal patients. The control amount is measured 
under the same or substantially similar experimental conditions as in measuring the test 
amount. For example, if a test sample is obtained from a subject's seminal plasma and a 
marker is detected using a particular probe, then a control amount of the marker is 
10 preferably determined from a seminal plasma sample of a patient using the same probe. It 
is preferred that the control amount of marker is determined based upon a significant 
number of samples from subjects who do not have prostate cancer (e.g., BPH or negative 
diagnosis patients) so that it reflects variations of the marker amounts in that population. 

C. Correlation By Computer 

1 5 Data generated by mass spectrometry is usefully analyzed by computer 

software. The software can comprise code that converts signal from the mass 
spectrometer into computer readable form. The software also can include code that 
applies an algorithm to the analysis of the signal to determine whether the signal 
represents a "peak" in the signal corresponding to a marker of this invention, or other 

20 useful markers. The software also can include code that executes an algorithm that 

compares signal from a test sample to a typical signal characteristic of "normal," "BPH" 
and "PCA" and determines the closeness of fit between the two signals. The software 
also can include code indicating which the test sample is closest to, thereby providing a 
probable diagnosis. The software can also include code that executes an algorithm which 

25 comprises an artificial intelligence program. Examples of artificial intelligence programs 
include fuzzy logic, cluster analysis or neural networks. Executing the artificial 
intelligence program can include code indicating that the test sample correlates most 
closely with a normal, BPH, PIN or PCA sample, thereby providing a probable diagnosis. 

Data generation in mass spectrometry begins with the detection of ions by 

30 an ion detector. Ions that strike the detector generate an electric potential that is digitized 
by a high speed time-array recording device that digitally captures the analog signal. 
Ciphergen's ProteinChip® system employs an analog-to-digital converter (ADC) to 
accomplish this. The ADC integrates detector output at regularly spaced time intervals 
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into time-dependent bins. The time intervals typically are one to four nanoseconds long. 
Furthermore, the time-of-flight spectrum ultimately analyzed typically does not represent 
the signal from a single pulse of ionizing energy against a sample, but rather the sum of 
signals from a number of pulses. This reduces noise and increases dynamic range. This 
5 time-of-flight data is then subject to data processing. In Ciphergen's ProteinChip® 
software, data processing typically includes TOF-to-M/Z transformation, baseline 
subtraction, high frequency noise filtering. 

TOF-to-M/Z transformation involves the application of an algorithm that 
transforms times-of- flight into mass-to-charge ratio (M/Z). In this step, the signals are 

10 converted from the time domain to the mass domain. That is, each time-of-flight is 
converted into mass-to-charge ratio, or M/Z. Calibration can be done internally or 
externally. In internal calibration, the sample analyzed contains one or more analytes of 
known M/Z. Signal peaks at times-of-flight representing these massed analytes are 
assigned the known M/Z. Based on these assigned M/Z ratios, parameters are calculated 

15 for a mathematical function that converts times-of-flight to M/Z. In external calibration, 
a function that converts times-of-flight to M/Z, such as one created by prior internal 
calibration, is applied to a time-of-flight spectrum without the use of internal calibrants. 
Baseline subtraction improves data quantification by eliminating artificial, reproducible 
instrument offsets that perturb the spectrum. It involves calculating a spectrum baseline 

20 using an algorithm that incorporates parameters such as peak width, and then subtracting 
the baseline from the mass spectrum. 

High frequency noise signals are eliminated by the application of a 
smoothing function. A typical smoothing function applies a moving average function to 
each time-dependent bin. In an improved version, the moving average filter is a variable 

25 width digital filter in which the bandwidth of the filter varies as a function of, e.g., peak 
bandwidth, generally becoming broader with increased time-of-flight. See, e.g., WO 
00/70648, November 23, 2000 (Gavin et al., "Variable Width Digital Filter for Time-of- 
flight Mass Spectrometry")- 

Analysis generally involves the identification of peaks in the spectrum that 

30 represent signal from an analyte. Peak selection can, of course, be done by eye. 

However, software is available as part of Ciphergen's ProteinChip® software that can 
automate the detection of peaks. In general, this software functions by identifying signals 
having a signal-to-noise ratio above a selected threshold and labeling the mass of the peak 
at the centroid of the peak signal. In one useful application many spectra are compared to 
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identify identical peaks present in some selected percentage of the mass spectra. One 
version of this software clusters all peaks appearing in the various spectra within a 
defined mass range, and assigns a mass (M/Z) to all the peaks that are near the mid-point 
of the mass (M/Z) cluster. 
5 Peak data from one or more spectra can be subject to further analysis by, 

for example, creating a spreadsheet in which each row represents a particular mass 
spectrum, each column represents a peak in the spectra defined by mass, and each cell 
includes the intensity of the peak in that particular spectrum. Various statistical or pattern 
recognition approaches can applied to the data. 

10 The spectra that are generated in embodiments of the invention can be 

classified using a partem recognition process that uses a classification model. In general, 
the spectra will represent samples from at least two different groups for which a 
classification algorithm is sought. For example, the groups can be pathological v. non- 
pathological (e.g., cancer v. non-cancer), drug responder v. drug non-responder, toxic 

1 5 response v. non-toxic response, progressor to disease state v. non-progressor to disease 
state, phenotypic condition present v. phenotypic condition absent. 

In some embodiments, data derived from the spectra (e.g., mass spectra or 
time-of-fiight spectra) that are generated using samples such as "known samples" can 
then be used to "train" a classification model. A "known sample" is a sample that is pre- 

20 classified. The data that are derived from the spectra and are used to form the 
classification model can be referred to as a "training data set". Once trained, the 
classification model can recognize patterns in data derived from spectra generated using 
unknown samples. The classification model can then be used to classify the unknown 
samples into classes. This can be useful, for example, in predicting whether or not a 

25 particular biological sample is associated with a certain biological condition (e.g., 
diseased vs. non diseased). 

The training data set that is used to form the classification model may 
comprise raw data or pre-processed data. In some embodiments, raw data can be 
obtained directly from time-of-flight spectra or mass spectra, and then may be optionally 

30 "pre-processed" as described above. 

Classification models can be formed using any suitable statistical 
classification (or "learning") method that attempts to segregate bodies of data into classes 
based on objective parameters present in the data. Classification methods may be either 
supervised or unsupervised. Examples of supervised and unsupervised classification 
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processes are described in Jain, "Statistical Pattern Recognition: A Review", IEEE 
Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. I, January 2000, 
which is herein incorporated by reference in its entirety. 

In supervised classification, training data containing examples of known 
5 categories are presented to a learning mechanism, which learns one more sets of 

relationships that define each of the known classes. New data may then be applied to the 
learning mechanism, which then classifies the new data using the learned relationships. 
Examples of supervised classification processes include linear regression processes (e.g., 
multiple linear regression (MLR), partial least squares (PLS) regression and principal 
10 components regression (PCR)), binary decision trees (e.g., recursive partitioning 

processes such as CART - classification and regression trees), artificial neural networks 
such as backpropagation networks, discriminant analyses (e.g., Bayesian classifier or 
Fischer analysis), logistic classifiers, and support vector classifiers (support vector 
machines). 

1 5 A preferred supervised classification method is a recursive partitioning 

process. Recursive partitioning processes use recursive partitioning trees to classify 
spectra derived from unknown samples. Further details about recursive partitioning 
processes are in U.S. Provisional Patent Application Nos. 60/249,835, filed on November 
16, 2000, and 60/254,746, filed on December 1 1, 2000, and U.S. Non-Provisional Patent 

20 Application Nos. 09/999,081, filed November 15, 2001, and 10/084,587, filed on 
February 25, 2002. All of these U.S. Provisional and Non Provisional Patent 
Applications are herein incorporated by reference in their entirety for all purposes. 

In other embodiments, the classification models that are created can be 
formed using unsupervised learning methods. Unsupervised classification attempts to 

25 learn classifications based on similarities in the training data set, without pre classifying 
the spectra from which the training data set was derived. Unsupervised learning methods 
include cluster analyses. A cluster analysis attempts to divide the data into "clusters" or 
groups that ideally should have members that are very similar to each other, and very 
dissimilar to members of other clusters. Similarity is then measured using some distance 

30 metric, which measures the distance between data items, and clusters together data items 
that are closer to each other. Clustering techniques include the MacQueen's K-means 
algorithm and the Kohonen's Self-Organizing Map algorithm. 

The classification models can be formed on and used on any suitable 
digital computer. Suitable digital computers include micro, mini, or large computers 
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using any standard or specialized operating system such as a Unix, Windows™ or 
Linux™ based operating system. The digital computer that is used may be physically 
separate from the mass spectrometer that is used to create the spectra of interest, or it may 
be coupled to the mass spectrometer. 
5 The training data set and the classification models according to 

embodiments of the invention can be embodied by computer code that is executed or used 
by a digital computer. The computer code can be stored on any suitable computer 
readable media including optical or magnetic disks, sticks, tapes, etc., and can be written 
in any suitable computer programming language including C, C++, visual basic, etc. 

10 Using pattern recognition techniques discussed above, classification 

algorithms may be developed from the classification models. These are highly useful in 
aiding the diagnosis of prostate cancer, prostate intraepithelial neoplasia , or benign 
prostate hyperplasia versus a negative diagnosis, using the markers of the present 
invention. By screening samples from sources known to have expressed each condition, 

1 5 the training data set, the pattern recognition techniques described herein identify 

difference patterns in the marker composition of diseased tissue relative to normal tissue. 
These difference patterns may comprise differences in the presence and/or amount of a 
particular marker or markers. Difference patterns may be built from a statistical analysis 
of several test data sets using methods well known to those of skill in the art. The 

20 difference patterns generated may then be used to construct classification models, as 
described above, and used in comparison studies to diagnose disease states of patients. 



V. KITS 

In yet another aspect, the invention provides kits for aiding a diagnosis of 
25 prostate cancer, wherein the kits can be used to detect the markers of the present 

invention. For example, the kits can be used to detect any one or combination of markers 
described above, which markers are differentially present in samples of a prostate cancer 
patient, BPH and normal patients. The kits of the invention have many applications. For 
example, the kits can be used to differentiate if a subject has prostate cancer, BPH or has 
30 a negative diagnosis, thus aiding a prostate cancer diagnosis. In another example, the kits 
can be used to identify compounds that modulate expression of the markers in in vitro 
prostate cells or in vivo animal models for prostate cancer. 
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In one embodiment, a kit comprises: (a) a substrate comprising an 
adsorbent thereon, wherein the adsorbent is suitable for binding a marker, and (b) a 
washing solution or instructions for making a washing solution, wherein the combination 
of the adsorbent and the washing solution allows detection of the marker using gas phase 
ion spectrometry. Such kits can be prepared from the materials described above, and the 
previous discussion of these materials (e.g., probe substrates, adsorbents, washing 
solutions, etc.) is fully applicable to this section and need not be repeated. 

In some embodiments, the kit may comprise a first substrate comprising an 
adsorbent thereon (e.g., a particle functionalized with an adsorbent) and a second 
substrate onto which the first substrate can be positioned to form a probe which is 
removably insertable into a gas phase ion spectrometer. In other embodiments, the kit 
may comprise a single substrate which is in the form of a removably insertable probe with 
adsorbents on the substrate. 

Optionally, the kit may further comprise instructions for suitable 
operational parameters in the form of a label or a separate insert. For example, the kit 
may have standard instructions informing a consumer how to wash the probe after a 
sample of seminal plasma is contacted on the probe. 

In another embodiment, a kit comprises (a) an antibody that specifically 
binds to a marker; and (b) a detection reagent. Such kits can be prepared from the 
materials described above, and the previous discussion regarding the materials (e.g., 
antibodies, detection reagents, immobilized supports, etc.) is fully applicable to this 
section and need not be repeated. 

In either embodiment, the kit may optionally further comprise a standard 
or control information so that the test sample can be compared with the control 
information standard to determine if the test amount of a marker detected in a sample is a 
diagnostic amount consistent with a diagnosis of prostate cancer. 

EXAMPLES 

The following examples show the identification of markers that have 
positive or negative correlations with prostate cancer, prostate intraepithelial neoplasia, or 
benign prostate hyperplasia. Markers were determined from samples of cell lysates 
isolated by laser capture microdissection. 
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VI. Tissue Specimens 

Prostate tissues were procured from consenting patients undergoing radical 
prostatectomy, following approved protocols. The tissues were processed immediately 
and stored in the Virginia Prostate Center's Bio-repository. Tissue pieces harvested for 
5 LCM were immediately embedded in OCT and stored at -80°C. One cryosection was 
stained with H&E and examined by a pathologist to identify cells of interest. Mirrored 
H&E stained sections, fixed in formalin and paraffin embedded, were also used to further 
aid in the identification of specific cell types. Additional serial frozen sections at 8 
microns were used for microdissection. 

10 VII. Laser Capture Microdissection 

Populations of normal luminal epithelia, Benign Prostatic Hyperplasia 
(BPH), Prostate Intraepithelial Neoplasia (PIN), and Prostate Adenocarcinoma (PCA) 
epithelial cells were microdissected from frozen tissue sections using the PixCell™ II 
Laser Capture Microdissection Microscope (Arcturus Engineering, Inc., Mountain View, 

15 CA) essentially as described by Emmert-Buck et al, Science, 274: 998-1001 (1996). See 
also United States Patent 5,843,644 (Liotta et al) and United States Patent 5,843,657 
(Liotta et al). The procedure for staining frozen sections for LCM was followed with 
slight modifications: the hematoxylin step was omitted and protease inhibitors (Complete, 
Roche Biomedical, Indianapolis, IN) were added to the staining baths. A total of 1000 

20 laser shots totaling 3000-6000 cells were procured for each cell type. Matched benign 
and diseased epithelial cell types were harvested from each prostate sample. In some 
cases, adjacent stroma cells were also procured. Samples were standardized by total 
number of laser shots, and duplicate samples were captured from the same areas of each 
serial section to check reproducibility. 

25 

VIII. Laser Desorption/Ionization Mass Spectrometry 

Cell lysates were immediately prepared after microdissection by adding 4 
fil of the lysis buffer containing 20 mM HEPES pH 8.0 with 1% Triton X-100 directly on 
the LCM cap. Each lysate was diluted 1:10 in PBS buffer, giving a total volume of 40 ul. 
30 Lysates were vortexed for 1 0 minutes at 4°C and centrifuged briefly to remove cellular 
debris. The supernatant was added to an IMAC3 (Immobilized Metal Affinity Capture) 
ProteinChip® Array (Ciphergen Biosystems Inc. Fremont, CA), pretreated with 100 mM 
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CuS0 4 , following the manufacturer's instructions. A bioprocessor (Ciphergen 
Biosystems) was fitted on top of the chip arrays to permit the addition of the 40 ul 
sample. To control for variation, cell lysates harvested from each prostate tissue were 
analyzed on a single Biochip. The array was then incubated with the diluted lysate 
5 overnight with continuous shaking. After removal of the lysate, each spot was washed 
twice with PBS followed by a final water rinse. 



acid (Ciphergen Biosystems) in 50%(v/v) acetonitrile, 0.5%(v/v) trifluoroacetic acid was 
added to each spot. Time of flight mass spectra were generated in a Ciphergen Protein 
10 Biology System II (PBS II) by averaging 120 laser shots collected in the positive mode at 
laser settings of 225 and 280. Data was calibrated externally using purified peptide and 
protein standards. 



abundances for each peak were calculated as follows. The relative abundance of the 
15 proteins were subdivided into three classes: Low (+) equates to 1-30% of spectral scale, 
Medium (++) equates to 31-60%, and High (+++) equates to 60-100%. Numerical values 
were then assigned to the abundance levels (i.e. (+) equates to 33, (++) equates to 66, 
(+++) equates to 100) and averaged for each cell type for the purpose of representing the 
protein expression between prostate cell types in graphical form. 
20 Sensitivity is defined as the percent of diseased (BPH/PIN/PCA) cell types 

for which the biomarker of interest is present, (true positive/total number of diseased 
lysates x 100). Specificity is defined as the percent of cell types for which the biomarker 
of interest is not positive, (true negative/total number of lysates without disease x 100). 

IX. Markers in Cell Lysates from Prostate Epithelial Tissue 
25 A. IMAC3 ProteinChip 



The arrays were allowed to air dry and a saturated solution of sinapinic 



Spectra were analyzed with the Ciphergen Peaks 2.1 software and relative 



1. 



Protocol: 



1. 



Draw a hydrophobic ring around each adsorbent spot on the chip. 



2. 



Load lOul 100 mM CuSCm. Incubate chip for 10 minutes in a moist 
chamber with shaking. 

Wash Chip with DI H 2 Q 10 seconds, around spots and remove excess 



30 



3. 



water. 
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4. Load IOjj.1 100 mM sodium acetate, pH 4.0. Incubate chip for 10 minutes 
in a moist chamber with shaking. 

5. Wash Chip with DI H 2 0 and allow to air dry. 

6. Lyse microdissected cells in 4ul of 20 mM HEPES 1% Triton X-100 
directly on the cap. Pipette the lysis buffer up and down on top of the cells 
until lysis occurs. Dilute lysate 1:10 in PBS and vortex cell lysate 10 
minutes at 4°C and spin down. 

7. Spot the entire lysate (40ul) on spot and incubate in a moist chamber 
overnight with continuous shaking. 

8. Wash each spot 2x with 40ul of PBS. 

9. Wash lx 40ul with water and air dry. 

10. Spot 2x 0.5u.l sinapinic acid in 50% acetonitrile, .5% trifluoroacetic acid. 

11. Air dry. 

2. SELDI Reader Data Collection 

1 . Peaks were collected on the Ciphergen Biosystems PBSII at a laser 
intensity of 225 and 280 with a detector sensitivity of 10. 120 shots were 
collected and averaged. 

2. Data was calibrated externally using purified peptide and protein 
standards. 

B. Results 

Pure populations of matched benign (normal or BPH), PIN (high grade) 
and/or PCA epithelial cells were obtained from nine prostatectomy specimens examined. 
Stroma cells were also microdissected from a selected subset of tissues to compare to the 
epithelial cell profiles. A total of 80 cell lysates from an average of 5000 cells/protein 
array were analyzed by SELDI time-of flight mass spectrometry. Processing the lysates 
on an immobilized metal affinity capture surface pre-treated with CuS0 4 resolved an 
average of 70 protein or peptide peaks in the range from 3 kDa to 100 kDa. 

Figure 1 is a representative spectrum of the protein profile of a PCA cell 
lysate in the mass range of 3000-80,000 Da. The advantages of the SELDI technology in 
resolving small molecular weight protein or peptide species is evident, since the majority 
of peaks observed were below 20 kDa. Figure la shows an abundant set of peaks 
observed as a triplet, with the predominant peak at m/z of 3459. This cluster was 
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observed in most cell lysates analyzed, stromal and epithelial, and has previously been 
identified in our laboratory as defensin a- 1 from the urine of bladder cancer patients using 
a SELDI immunoassay (see e.g. Vlahou et al, Am. J. Pathol, 158: 1491-1502, (2001)). 
A peak at m/z of 28445 (Figure lc) is most likely free PSA. Serum albumin (HSA), a 
5 peak at m/z of 66866 was also present in all cell lysates tested (see Figure lc). 

Differential protein expression between cell types was evaluated by 
determining the abundance (i.e., over-expression or under-expression) of each peak 
relative to each matched cell type. Figure 2a is a an example of a group of peaks between 
4000 Da and 6000 Da showing differential expression between the epithelial cell types 

10 obtained from the same prostate tissue specimen. Three of these peaks (m/z 4030 (EP2), 
4358 (EP3), 4753 (EP6)) occur in greater frequency and abundance in the PIN and PCA 
lysates. As a general rule, peptide or protein species found in greater abundance in the 
PCA profiles, were also found in increasing amounts in the PIN profiles. Figure 2b 
shows the over-expression of the peak with an average m/z of 4749 (EP6) in the PIN and 

1 5 PCA lysates from three different prostatectomy specimens. Duplicate samples exhibited 
a high degree of reproducibility. In contrast, regions of heterogeneity were present in the 
protein profiles derived from two different foci of PCA in Patient #2 and two foci of PIN 
in Patient #3. Profile differences were also found in the BPH cell lysates. Figure 2c is an 
example of a peak at m/z of 5666 (EP8), which appears in greater abundance in the BPH 

20 epithelial cells when compared to the spectra of the other matched cell types. With few 
exceptions, the protein patterns found in BPH cell lysates more closely resembled the 
normal epithelial profile. 

The profiles of proteins greater than 10 kDa also demonstrated some 
differential expression patterns. One peak with an average m/z 28442 was found in lower 

25 abundance in the BPH, PIN and PCA epithelial cells compared to profiles of matched 
normal cell lysates. This mass is consistent with the molecular weight of free PSA 
(Belanger et al. Prostate 27: 187-197 (1995)). An example of this decrease in 
intracellular PSA levels for PIN and PCA cells can be seen in Figure 3. A large PSA 
peak at 28393 Da was present in the normal prostate epithelia but absent from adjacent 

30 stroma cells. The PIN and PCA cells have a greatly reduced PSA peak. Normal 

epithelial cells procured directly adjacent to the PCA foci also exhibited a large amount of 
intracellular PSA. The 66838 Da peak is albumin and was present in all cell lysates at 
high abundance. 
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These comparative analyses suggested that several proteins showed 
differential expression between the four cell types. However, to more precisely select 
those with high biomarker potential, it was necessary to standardize peak expression 
levels throughout the matched populations. The relative abundance of the proteins were 
5 subdivided into three classes: Low (+) equates to 1-33% of spectral scale, Medium (++) 
equates to 33-66%, and High (+++) equates to 66-100%. Each individual spectrum was 
expanded in specific mass ranges to assist in determining the relative abundance of each 
observed peak. An in-house computer program was used to cluster the peaks to obtain an 
average mass for each peak with a mass error range of 0.15%. From an average of 70 

1 0 peaks observed, 14 (20%) displayed expression differences between the epithelial cell 
profiles (Table 1). The scores in Table 1 were converted to numerical values and plotted 
in a histogram showing the average differences in expression levels between cell types 
subtracted from the expression level in normal epithelia (Figure 4). The over-expression 
and under-expression for these 14 protein/peptides was "normalized" and evaluated for 

1 5 BPH, PIN and PCA cells. Generally, the PCA and PIN levels of expression for protein or 
peptides were similar. As shown in Figure 4, there were higher levels of a group of small 
molecular weight peptides/proteins between 3000 and 6000 Da in the PIN and PCA 
profiles. The first of these, 3448 Da, was found increased in BPH, PIN, and PCA cell 
lysates. Three peaks (4036 Da (EP2), 4361 Da (EP3), and 4749 Da (EP6)) were over- 

20 expressed in PIN and PCA with the highest level of over-expression in PCA lysates. 

Note that none of the BPH cell lysates exhibited the 4036 Da (EP2) peak. A summary of 
the percent of each cell type exhibiting those most significantly increased or decreased 
proteins is presented in Table 2. 
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Table 1: Comparison of the abundance levels* of proteins in lysates of LCM procured 
prostate cells (epithelial and stromal). *Relative abundance of the proteins: + = 1-33% of 
spectral scale, ++ = 33-66%, +++ = 66-100%, 0 = peak not detected. 
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. Two peaks (4639 Da (EPS) and 24184 Da (EP12)) were specific to PCA 
lysates in 43 % of the PCA samples tested. The peak at 4749 Da (EP6) was over- 
expressed in 67% and 57% of PIN and PCA lysates, respectively, with the highest overall 
peak expression observed in the PCA lysates. A 14696 Da (EP1 1) peak was present in 
56% of the PIN cell lysates. Also of note, was a large increase in expression of a peak at 
5666 Da (EP8) found in 86% of the BPH profiles. A peak at m/z 4413 (EP4) and two 
high molecular weight proteins (48208 Da (EP13) and 53836 Da (EP14)) were also found 

41 



1/23/2009, EAST Version: 2.3.0.3 



WO 03/072710 PCT/US03/05042 

to have increased expression in BPH cells. As shown in Figure 4, a few proteins were 
under-expressed in BPH, PIN, and PCA cells where compared to the expression levels in 
normal epithelial cells. Expression of the 5666 Da (EP8) peak, which was over-expressed 
in BPH profile, was reduced in 22% of PIN ad 71% of PCA cell lysates at significant 
5 levels (Table 2). In addition, decreased expression of an 1 1 744 Da protein (EP10) and 
PSA (28422 Da) was found in 56 % and 57% of the PIN and PCA cell lysates. The 
reduced expression of intracellular PSA was found in 56% and 57% of PIN and PCA cell 
lysates, respectively. 

10 Table 2: Percent of prostate cells lysates (BPH/PIN/PCA) showing differential expression 
of selected proteins (i.e. greater or lesser abundance) when compared to the expression in 
matched normal prostate epithelial cell lysates. 

BPH PIN PCA 

15 % of sample % of sample % of sample 
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The significance of the expression differences between cell types was 
evaluated using the Wilcoxon signed rank test (Table 3). Expression differences were 

20 found to be significant (p<.05) for BPH vs. PIN and BPH vs. PCA for over-expression of 
the 4036 Da (EP2) protein (Table 3). A comparison of N vs. PCA approaches 
significance (p=.059) for this peak. Similarly, significant differences were observed in N 
vs. PCA, and BPH vs. PIN in over-expression of the 4361 (EP3) peak. Over-expression 
of the 4749 Da (EP6) peak was significant for N vs. PIN and approaches significance for 

25 BPH vs. PIN (Table 3). The 5666 Da (EP8) protein was significantly over-expressed in 
N vs. BPH and, interestingly, is significantly under-expressed in BPH vs. PIN and BPH 
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vs. PCA. The under-expression of PSA was significant in N vs. PIN and approaches 
significance (p=.066) for N vs. PCA. 



Table 3: Wilcoxon signed ranks test of peak expression differences. 



Peak (Da) 


N vs. BPH 


N vs. PIN 


N vs. PCA 


BPH vs. 


BPH vs. 


PIN vs. 










PIN 


PCA 


PCA 


4036 


0.317 


0.083 


0.059 


0.025' 


0.039' 


0.102 


4361 


0.564 


0.059 


0.038' 


0.046' 


0.102 


0.083 


4413 


0.083 
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0.317 


0.655 


1.000 


0.317 


4639 


1.000 


1.000 


0.083 


1.000 


0.157 


0.083 


4749 


0.317 


0.034 1 
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0.059 


0.109 


0.564 


5666 


0.038 1 


1.000 


0.059 


0.024 2 


0.039 2 


0.408 


28422 


0.083 


0.041 2 


0.066 


0.680 


0.180 


0.564 


(PSA) 















' significant p-values ( ^0.05), based on over-expression as compared to N or BPH 
2 significant p-values ( <0.05), based on under-expression as compared to N or BPH 



Since no single peptide or protein was discovered to be present, or 
differentially expressed in all the PIN or PCA profiles, various combinations of selected 
proteins were evaluated to identify a panel of markers (expression levels) that could 
improve disease classification of each specific cell type. A biomarker combination was 

1 0 classified as positive if any marker in the combination was present in a sample, and 
negative if none of the markers were detected in a specimen. Since these markers were 
over-expressed in both PCA and PIN, the combination of the markers did not improve the 
specificity for each diseased cell type individually. Better discrimination was achieved 
by combining the benign cell types (N, BPH) and comparing them to diseased cell types 

1 5 (PIN, PCA). The sensitivity and specificity of each marker for benign versus disease was 
then calculated, both individually and in additive combinations. As shown in Table 4, by 
combining markers EP2 (m/z 4036) and EP5 (m/z 4639), there was an improvement in 
the sensitivity for PCA from 71% to 86%, while maintaining a specificity of 100% for 
PIN or PCA combined. This combination of markers also identified 44% of the PIN 

20 samples. Combining marker EP3 (m/z 4361) and marker EP5 (m/z 4639) increased the 
sensitivity for PCA to 100%, while the specificity decreased to 87% for PIN and PCA. 
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By combining markers EP3 and EP6 (m/z 4749), 100% of the PCA and 89% of the PIN 
samples could be correctly identified, with a slight decrease in specificity. Combinations 
involving three or more of the markers did not improve the overall specificity and 
sensitivity. The 5666 Da protein, (marker EP8, Table 4) had a sensitivity of 86% and 
5 specificity of 88% for detecting BPH. This marker also provided some discrimination 
between BPH and PIN lysates having 22% sensitivity and 68% specificity for PIN 
epithelia. 

Table 4: Sensitivity and specificity of single markers and combinations of markers for the 
1 0 detection of abnormal (diseased) cells. *The increase in abundance of each marker (EP2- 
EP6) found in PIN and PCA epithelial cells was determined from matched normal or 



BPH epithelial cell lysates. ** The sensitivity and specificity of a marker (EP8) found in 
abundance in BPH samples when compared to PIN and PCA samples. 







Sensitivity 


Specificity 


Specificity 


Marker 


Expression* 


PCA 


PIN 


PCA 


PIN 


PIN or PCA 


EP2 


increase of m/z 4036 


71% 


44% 


83% 


77% 


100% 


EP3 


increase of m/z of 4361 


71% 


44% 


75% 


68% 


87% 


EP5 


presence of m/z 4639 


43% 


0% 


100% 




100% 


EP6 


increase of m/z of 4749 


57% 


67% 


71% 


71% 


93% 


Combination 


EP2 + EP3 


86% 


67% 


67% 


64% 


87% 




EP2 + EP5 


86% 


44% 


83% 


77% 


100% 




EP3 + EP5 


100% 


56% 


75% 


68% 


87% 




EP2 + EP6 


86% 


78% 


67% 


68% 


93% 




EP3 + EP6 


100% 


89% 


54% 


55% 


80% 




EP5 + EP6 


86% 


67% 


71% 


68% 


93% 




EP2 + EP3 + EPS 


86% 


67% 


67% 


64% 


93% 




EP2 + EP3 + EP6 


100% 


89% 


54% 


59% 


80% 






BPH 


PIN 


BPH 


PIN 




EP8 ** 


increase of m/z 5666 


86% 


22% 


88% 68% 





1 5 Multivariable analysis of the peak expression data was evaluated to 

determine if the simultaneous over-expression or under-expression of several proteins in 
combination could be used to predict benign versus diseased cell lysates. Logistic 
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regression analysis was performed with the seven most significant differentially 
expressed peaks (4036 Da, 4413 Da, 4639 Da, 4729 Da, 5666 Da, and 28422 (PSA)). 
Table 5 shows a predictive equation based on diseased (PIN/PCA) versus benign 
(N/BPH), resulted in 93.3% specificity and 93.8% sensitivity for PIN or PCA cell lysates. 

5 

Table 5: Classification table based on logistic regression analysis using expression levels 
of seven proteins 4036 Da (EP2), 4361 Da (EP3), 4413 Da (EP4), 4639 (EP5), 4749 Da 
EP6), 5666 Da (EP8), 28422 Da (PSA). 
10 







predicted 








PIN/PCA 


% 




observed 


N 


Y 


correct 


PIN/PCA 


N 


15 


14 


1 


93.3 (specificity) 




Y 


16 


1 


15 


93.8 (sensitivity) 












93.5 (overall) 



The development of cancer is a multistep process involving many events 
in the expression of oncogenes and tumor suppressor genes. Single changes in gene 
expression can lead to the differential expression of a myriad of proteins. Analyzing the 

1 5 multiple protein changes that occur in prostate disease progression is a difficult task, 
made even more daunting by the biological heterogeneity of the disease. Advances in 
time of flight mass spectroscopy, resulting in the SELDI technology, have lead to a 
method for the sensitive and direct analysis and profiling of proteins in complex 
biological samples. Previous studies in this and other laboratories have profiled a limited 

20 number of prostate cell specimens (see e.g. Paweletz, et al., Drug Development Research 
49: 34-42 (2001), Wright, et al, Prostate Cancer and Prostate Diseases 2: 264-267 
(2000), and Paweletz, et al., Urology 57: 160-163 (2001)). In this study we combined 
SELDI with LCM to produce protein profiles from 40 prostate cell lysates derived from 
nine prostatectomy specimens. Our results demonstrate that the protein profiles of 

25 normal prostate epithelia, BPH, PIN and PCA display discriminating differences. 

Protein extracts prepared from prostate cell types were analyzed using an 
IMAC3 protein biochip pretreated with Q1SO4. An average of 70 protein peaks were 
detected from the cell lysates using this surface. Overall there were a relatively large 
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number of common peaks in the benign and diseased epithelial cell profiles, and very few 
peaks that would, based on presence or absence, be candidate biomarkers for the disease 
progression and/or diagnosis of prostate cancer. It was therefore determined that 
calculating abundance levels of peaks would enhance the significance of the results and 
5 identify possible expression differences between the samples. Expression levels were 
calculated for diseased cell profiles as compared to expression levels found in matched 
benign cell types. Out of the 70 peaks observed, 14 of these (20%) displayed 
deregulation in the diseased profiles. 

Several small molecular weight peptides or proteins (3000 to 6000 Da) had 

1 0 increased expression levels in the PIN and PCA cell extracts. The cluster of peaks in this 
range could have originated from proteolysis, or may be cleavage products of larger 
proteins, but were consistent enough in common cell types to be considered part of the 
general profile. However, even cleavage products of proteins may be indicative of 
changes occurring in the prostate disease cycle, as demonstrated in a study of 

15 semenogelin, from seminal plasma {see Merchant, et al, Electrophoresis, 21: 1 164-1 177 

(2000) ). Furthermore, since the IMAC surface has the ability to bind phosphorylated 
peptides or proteins it is feasible that some of the changes observed are due to differential 
phosphorylation of the proteins or peptides in the diseased cell types. Two peaks 
identified in this study with an average mass 4827 Da ± 26.5 (EP7) (common in the 

20 benign cell types) and 4749 Da ± 26.1 (EP6) (overexpressed in PIN/PCA, see Table 1) 
may represent a de-phosphorylation event occurring in the PIN and PCA cell lysates. The 
average mass shift between these proteins is 78 Da, close to the calculated mass shift of 
79 Da for a 5' de-phosphorylation. Phosphorylation is one of the most important post- 
translational modifications of proteins and enables regulation of many physiological cell 

25 processes (transport, proliferation, differentiation). Examples of aberrant phosphorylation 
of proteins found in cancer studies include ERK 1/ 2 in breast cancer {see Gee, et al, Int. 
J. Cancer 95: 247-254 (2001)), and androgen receptor in prostate cancer {see Wang, et 
al, Biochem. Biophys. Res. Commun., 259: 21-28 (1999). 

Alternatively, these small molecules could themselves be intact functional 

30 proteins or peptides, examples of which include pro-hormones, growth factors, amidated 
peptides, and defensins. In a recent study by Rocchi et al., Cancer Res., 61:1 196-1206 

(2001) PC-3 and Dul45 prostate cancer cell lines were found to produce and secrete a 
multifunctional amidated peptide. Immunohistochemical staining was also performed on 
human prostate tissue and showed increased levels of adrenomedullin staining in 

46 



1/23/2009, EAST Version: 2.3.0.3 



WO 03/072710 PCT/US03/05042 

cancerous epithelia. The authors further demonstrated the activity of the enzyme 
peptidylglycine alpha-amidatin monooxygenase (PAM) in prostate cancer cell lines, 
which produces alpha-amidated bioactive peptides from their inactive glycine-extended 
precursors. C-Terminal amidation is common in peptide hormones, and these peptides 
5 are produced in neuroendocrine cells, known to be present in prostate carcinoma, and 
function as growth factors. 

Previous studies have shown a cytogenetic link between high grade PIN 
and Prostate Cancer strengthening its role as a precursor lesion {see Alcaraz, et al, 
Prostate, 47: 29-35 (2001)). In addition, greater than 50% of patients with high grade 

10 PIN have cancer detected on a subsequent biopsy (see Park, et al, J. Urol. 165: 1409- 

14140 (2001). Therefore, the identification of proteins associated with PIN may be useful 
as markers of early detection. In this study PIN and PCA cell lysates exhibited similar 
protein profiles. Three peaks at 4036 Da (EP2), 4361 Da (EP3), and 4749 Da (EP6), 
showed increased abundance in PIN and PCA lesions when compared to matched benign 

1 5 cell profiles. Two other peaks at 1 1744 (EP10), and 28442 Da (free PSA) had decreased 
abundance in BPH, PIN and PCA cell extracts, and one marker at 14696 Da (EP1 1) was 
over-expressed in 56% of the PIN samples and only 29% of matched PCA samples. 
Interestingly, a few of these peaks, based on closely matched molecular weights, were 
found to be present in a matched seminal plasma from one of the patients donating tissue 

20 for this study (data not shown). Since these seminal plasma profiles were generated using 
the MAC surface pre-treated with CuS0 4 , they may represent the same protein, however, 
additional matched seminal plasma samples will need to be examined to confirm this 
result. The identification of these peptides or proteins and their use as possible markers 
of early detection is currently under investigation in our laboratory. 

25 The independent evolution of PIN and tumor foci is a common feature of 

primary tumors of the prostate, making a panel of biomarkers the most likely solution to 
improvements in early detection and diagnosis. A combination of biomarkers with the 
most significant differential expression was therefore investigated. Since most of the 
markers found in the PCA profiles were also present in the PIN profiles the ability to 

30 discriminate these two cell types from each other was difficult; whereas, better 

discrimination could be achieved between the benign (N, BPH) and the diseased cell 
types (PIN or PCA combined). However, a combination of markers 4361 Da (EP3), and 
4749 Da (EP6) increased the sensitivity to 100% for detection of PIN and PCA with a 
only a slight decrease in specificity. Also of note, were two peaks (4639 Da (EP5)and 
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241 84 Da (EP12)) found only in PC A cell lysates. Both of these markers were expressed 
in 43% of tumor samples. No correlation of biomarker expression with grade of tumor 
was observed. Further studies using different capture surfaces for SELDI profiling may 
reveal additional PCA biomarkers. Protein profiles of metastatic prostate samples will be 
5 required to determine if any of the selected "biomarkers" represent markers of metastatic 
potential. 

SELDI analyses performed on cell lysates of surrounding adjacent normal 
epithelia as well as stroma was performed to determine if possible field effects accounted 
for the differences in the protein profiles. These analyses revealed interesting information 

1 0 on the milieu of proteins present in the diseased prostate cells. Defensins a-1 and a-2 
were present in both epithelial and stroma cells with increased amounts in BPH, PIN and 
PCA cell lysates and surrounding stroma. Defensins are a family of small peptides with 
antimicrobial, cytotoxic, and antitumor activities (see Lichtenstein, et al, J. Immunol. 
140: 2686-2694 (1988). Two types of defensins have been characterized. jS-defensins are 

1 5 primarily expressed by epithelial cells of the kidneys, skin, and respiratory tract, (see 
Yang, et al, Science, 286: 525-528 (1999), and Zhao, et al, FEBSLett., 396: 319-322 
(1996)), whereas, odefensins are expressed by neutrophils and Paneth cells (see Ayabe, 
et al, Nat. Immunol, 1:113-118 (2000)). Recently o>defensins have been found in the 
submandibular glands of patients with oral carcinoma (see Mizukawa, et al, Anticancer 

20 Res., 20: 1 125-1 127, (2000), and Mizukawa, et al, Anticancer Res., 19: 2969-2971 
(1999)), and, a study found c<-defensin peptides in the urine of bladder cancer patients 
(see Vlahou, et al, Am. J. Pathol, 158: 1491-1502 (2001)). It is possible that neutrophils 
in the surrounding area are secreting these defensins into prostate tissue in response to an 
inflammatory or disease state. Unfortunately no tissue from a normal non-diseased 

25 prostate was available to use as a control to test this hypothesis. Another peak at 4361 Da 
(EP3), found to be over-expressed in the PIN and PCA cell extracts, was also present at 
higher levels in stroma adjacent to the diseased cells, suggesting a possible field effect for 
this peptide. 

Differences were observed for the expression of a 28,400 Da peak, which 
30 is presumed to be intracellular free PSA The molecular weight identified in our study 
closely matches the observed molecular mass of 28,430 Da for free PSA determined 
using ion spray mass spectroscopy (ISMS) (see Belanger, et al, Prostate, 27: 187-197 
(1995). This peak was also absent from matched stroma cell lysates, indicating its 
specific expression in epithelial cells. Furthermore, data obtained from an immunoassay, 
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also performed using the SELDI platform, Wright, et al, Prostate Cancer and Prostate 
Diseases 2: 264-267 (2000), has identified this peak from LCM lysates as PSA based on 
immunoaffinity (data not shown). In this study, normal epithelia of the prostate 
expressed large amount of PSA whereas the diseased cell types (PIN and PCA) had 
5 reduced expression levels. As expected, PSA was absent from stroma cells. Normal 
epithelia microdissected directly adjacent to the tumor foci also expressed high levels of 
PSA. This decrease in intracellular PSA in PCA cells is in agreement with other studies. 
For example, Stege et al, Prostate, 38: 183-188 (1999) found tissue PSA levels lower in 
cancerous than in normal tissue, and a study by Weir et al, J. Urol, 163: 1739-1742, 

1 0 (2000) found immunohistochemical staining intensity of PSA inversely correlated with 
histological grade of the tumor. Interestingly a recent report by Pawletz et al, Drug 
Development Research 49: 34-42 (2001), also found a 28 kDa protein peak (not identified 
as PSA) via SELDI to be down-regulated in microdissected PCA cells when compared to 
matched normal epithelia. Our result confirms the current opinion that the increase in 

1 5 serum PSA in men with prostate cancer is not due to increased production of PSA by the 
tumor cells, but an increased leakage or secretion of PSA into the circulation by the 
surrounding normal epithelium. 

The BPH protein profiles also displayed some notable differences. There 
was an increase in defensin a-1 (3448 Da, EP1) in the BPH lesions, as well as an increase 

20 in abundance of peaks at 4413 Da (EP4), and 5666 Da (EP8). Of special interest was the 
over-expression of a peak at 5666 Da (EP8) found in greater abundance in 86% of the 
BPH cell lysates tested with a specificity of 88%. Only 22% of the PIN lesions, and none 
of the PCA lesions overexpressed this marker. Efforts are underway to characterize this 
marker. A protein biomarker indicative of BPH alone may, if secreted into serum or 

25 seminal plasma, be useful in the prevention of unnecessary biopsies in patients with 

elevated PSA. Two other high molecular weight proteins at 48308 Da (EP13) and 53830 
Da (EP14) were also found at increased abundance in BPH cell lysates. 

The present invention provides novel materials and methods for aiding 
30 prostate cancer diagnosis using markers that are differentially present in samples pure 
matched benign (normal or BPH), PIN (high grade), and/or PCA cells. While specific 
examples have been provided, the above description is illustrative and not restrictive. 
Any one or more of the features of the previously described embodiments can be 
combined in any manner with one or more features of any other embodiments in the 
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present invention. Furthermore, many variations of the invention will become apparent to 
those skilled in the art upon review of the specification. The scope of the invention 
should, therefore, be determined not with reference to the above description, but instead 
should be determined with reference to the appended claims along with their full scope of 
5 equivalents. 

All publications and patent documents cited in this application are incorporated by 
reference in their entirety for all purposes to the same extent as if each individual 
publication or patent document were so individually denoted. By their citation of various 
references in this document, Applicants do not admit any particular reference is "prior 
1 0 art" to their invention. 
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WHAT IS CLAIMED IS: 


1 




1 . A method of qualifying a prostate cancer status in a subject 


2 


comprising: 




3 




(a) measuring at least one biomarker in a sample from the subject, 


4 


wherein the biomarker is selected from the group consisting of: 


5 




Marker EP1: 3448 ± 19 Da, 


6 




Marker EP2: 4036 ± 22 Da, 


7 




Marker EP3: 4361 ±24 Da, 


8 




Marker EP4: 4413 ± 24 Da, 


9 




Marker EP5: 4639 ± 26 Da, 


10 




Marker EP6: 4749 ±26 Da, 


11 




Marker EP7:4827± 27 Da, 


12 




Marker arts, oooo ± 5 1 Da, 


13 




Marker EP9: 8445 ±46 Da, 


14 




Marker EP10: 11744 ±65 Da, 


15 




Marker EP1 1 : 14696 ± 81 Da, 


16 




Marker EP12: 24184 ± 133 Da, 


17 




Marker EP13: 48308 ± 266 Da, 


18 




Marker EP14: 53830 ± 296 Da; and combinations thereof; 


19 


and, 




20 




(b) correlating the measurement with prostate cancer status. 


1 


2. 


The method of claim 1, wherein the prostate cancer status is 


2 


selected from the group consisting of prostate cancer (PCA), prostate intraepithelial 


3 


neoplasia (PIN), and 


benign prostate hyperplasia (BPH). 


1 


3. 


The method of claim 1 , wherein the sample is prostate tissue 


2 


extract. 






4. 


The method of claim 1, wherein measuring comprises determining 


2 


the mass of the protein. 


1 


5. 


The method of claim 4, wherein the mass of the protein is 


2 


determined by mass spectrometry. 
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1 6. The method ofclaiml, wherein the sample is selected from the 

2 group consisting of blood, serum, urine, prostatic fluid, seminal fluid, semen, and prostate 

3 tissue. 

1 7. The method of claim 5, wherein mass spectrometry is gas phase ion 

2 spectrometry. 

1 8. The method of claim 7, wherein gas phase ion spectrometry is laser 

2 desorption ionization mass spectrometry. 

1 9. The method of claim 1, wherein step a) further comprises detecting 

2 the marker by immunoassay. 

1 10. The method of claim 1, wherein 

2 step a) further comprises: 

3 i) fractionating the sample; 

4 ii) binding a fraction of the sample to an adsorbent; and, 

5 iii) comprises detecting the marker by gas phase ion spectrometry. 

1 11. The method of claim 10, wherein the adsorbent is selected from the 

2 group consisting of a hydrophilic adsorbent, a metal chelate adsorbent, and a strong anion 

3 exchange adsorbent. 

1 12. The method of claim 1 , wherein 

2 step a) comprises: 

3 i) embedding a portion of a tissue specimen harvested from a 

4 patient in OCT and freezing the specimen; 

5 ii) obtaining cryosections from the tissue specimen; 

6 iii) obtaining cell samples from the cryosections by laser capture 

7 microdissection; 

8 iv) mixing cell samples from step (d) with lysis buffer thereby 

9 producing cell lysates; 

10 v) diluting and vortexing the cell lysates, 

1 1 vi) centrifuging the vortexed cell lysates thereby producing a 

12 supernatant fraction; 
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13 



14 



vii) binding the supernatant fraction to an adsorbent; and, 

viii) comprising detecting the marker using gas phase ion 



spectrometry. 



13. The method of claim 12, wherein the adsorbent of step a) is 
selected from the group consisting of a hydrophilic adsorbent, a metal chelate adsorbent, 
and a strong anion exchange adsorbent. 



15. The method of claim 1, wherein the sample comprises the markers 

EP2andEP3. 

16. The method of claim 1 , wherein the sample comprises the markers 

EP2 and EP5. 

17. The method of claim 1 , wherein the sample comprises the markers 

EP3 and EP5. 

1 8 . The method of claim 1 , wherein the sample comprises the markers 

EP2 and EP6. 



2 EP3 and EP6. 

1 20. The method of claim 1 , wherein the sample comprises the markers 

2 EP5 and EP6. 

1 21. The method of claim 1 , wherein the sample comprises the markers 

2 EP2, EP3 and EP5. 

1 22. The method of claim 1 , wherein the sample comprises the markers 

2 EP2, EP3 and EP6. 

1 23 . The method of claim 1 , wherein the sample comprises the markers 

2 EP2, EP3, EP4, EP5, EP6, and EP8. 



14. 



The method of claim 1, wherein the sample comprises the marker 



EP8. 



19. 



The method of claim 1, wherein the sample comprises the markers 
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1 24. The method of claim 8, wherein 

2 step b) comprises: 

3 i) generating data for each marker with the mass 

4 spectrometer, the data comprising a mass/charge ratio and an amount determination for 

5 each ion corresponding to each marker; 

6 ii) transforming the data into computer-readable form; and, 

7 iii) executing an algorithm with a programmable digital 

8 computer, 

9 wherein the algorithm determines closeness-of-fit between the computer- 

1 0 readable data and a data set indicating a diagnosis of PCA, PIN, BPH or a negative 

1 1 diagnosis. 

1 25 . A method for detecting at least one marker in a sample, the method 

2 comprising: 

3 a) obtaining a sample comprising at least one marker, where each 

4 marker is selected from the group consisting of: 

5 Marker EP1: 3448 ± 19 Da, 

6 Marker EP2: 4036 ±22 Da, 

7 Marker EP3: 4361 ± 24 Da, 

8 Marker EP4: 441 3 ± 24 Da, 

9 Marker EP5: 4639 ± 26 Da, 

1 0 Marker EP6: 4749 ± 26 Da, 

1 1 Marker EP7: 4827± 27 Da, 

1 2 Marker EP8: 5666 ± 3 1 Da, 

1 3 Marker EP9: 8445 ± 46 Da, 

14 Marker EP10: 1 1744 ± 65 Da, 

1 5 Marker EP 1 1 : 14696 ± 8 1 Da, 

16 Marker EP12: 24184 ± 133 Da, 

17 Marker EP13: 48308 ±266 Da, and 

18 Marker EP14: 53830 ± 296 Da; and, 

19 b) detecting the marker by gas phase ion spectrometry. 

1 26. The method of claim 25, wherein gas phase ion spectrometry is 

2 laser desorption/ionization mass spectrometry. 
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1 27. The method of claim 22, wherein the sample comprises at least two 

2 markers wherein each marker is differentially present in the sample. 

1 28. The method of claim 26, further comprising: 

2 c) generating data for each marker with the mass spectrometer, the 

3 data comprising a mass/charge ratio and an amount determination for each ion 

4 corresponding to each marker, 

5 d) transforming the data into computer-readable form; and 

6 e) executing an algorithm with a programmable digital computer 

7 wherein the algorithm detects the amount determination in the computer- 

8 readable data representing the marker and determines closeness-of-fit between the 

9 computer-readable data and a data set indicating a diagnosis of PC A, PIN, BPH or a 
10 negative diagnosis. 

1 29. The method of claim 29, wherein the algorithm comprises an 

2 artificial intelligence program. 

1 30. The method of claim 29, wherein the artificial intelligence program 

2 is a fuzzy logic, cluster analysis or neural network. 

1 31. The method of claim 25, wherein step a) further comprises: 

2 fractionating the sample by size exclusion chromatography or anion exchange 

3 chromatography, and collecting a fraction that includes the marker or markers. 

1 32. The method of claim 25, wherein step a) further comprises 

2 contacting the sample with a substrate comprising an adsorbent that retains the marker 

3 and removing unretained sample. 

1 33. The method of claim 29, wherein the substrate is a mass 

2 spectrometer probe comprising the adsorbent on a probe surface. 

1 34. The method of claim 29, wherein the substrate is a resin, and step 

2 a) further comprises placing the resin with the marker retained by the adsorbent on a mass 

3 spectrometer probe. 
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1 35. The method of claim 29, wherein the adsorbent is selected from the 

2 group consisting of a hydrophilic adsorbent, a strong anion exchange adsorbent and a 

3 metal chelate adsorbent. 

1 36. The method of claim 26, wherein 

2 step a) further comprises: 

3 i) providing a probe adapted for use with a mass 

4 spectrometer comprising an adsorbent attached thereto; 

5 ii) contacting the sample comprising the marker with the 

6 adsorbent. 

1 37. The method of claim 26, wherein 

2 step a) further comprises: 

3 i) providing a substrate comprising an adsorbent attached 

4 thereto; 

5 ii) contacting the sample comprising the protein with the 

6 adsorbent; 

7 iii) placing the substrate on a probe adapted for use with a 

8 mass spectrometer. 

1 38. The method of claim 36, wherein the adsorbent is a hydrophilic 

2 adsorbent or a metal chelate adsorbent. 

1 39. The method of claim 37, wherein the adsorbent is a hydrophilic 

2 adsorbent comprising silicon oxide. 

1 40. The method of claim 37, wherein the adsorbent is a metal chelate 

2 adsorbent comprising copper. 

1 41 . The method of claim 37, wherein the adsorbent comprises an 

2 antibody that specifically binds to the marker. 

1 42. A purified marker selected from the group consisting of: 

2 Marker EP1: 3448 ± 19 Da, 

3 Marker EP2: 4036 ±22 Da, 

4 Marker EP3: 436 1 ± 24 Da, 
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5 Marker EP4: 4413 ± 24 Da, 

6 Marker EP5 : 4639 ± 26 Da, 

7 Marker EP6: 4749 ±26 Da, 

8 Marker EP7:4827± 27 Da, 

9 Marker EP8: 5666 ±31 Da, 

1 0 Marker EP9: 8445 ± 46 Da, 

1 1 Marker EP 1 0: 1 1 744 ± 65 Da, 

12 Marker EP1 1 : 14696 ± 81 Da, 

13 Marker EP12: 241 84 ± 133 Da, 

14 Marker EP13: 48308 ±266 Da, and 

15 Marker EP14: 53830 ± 296 Da. 

1 43 . The purified protein of claim 42, produced by a process 

2 comprising: 

3 i) fractionating a sample comprising the marker or markers by size 

4 exclusion chromatography or anion exchange chromatography; and, 

5 ii) collecting a fraction that includes the marker or markers. 

1 44. The purified protein of claim 42, produced by a process 

2 comprising: 

3 i) microdissecting a cell sample comprising the marker by laser 

4 capture microdissection, thereby producing isolated cells, 

5 ii) lysing the isolated cells producing a cell lysate, 

6 iii) centrifuging the cell lysate, thereby producing a cell supernatant 

7 comprising the marker, 

8 iv) contacting the cell supernatant to an adsorbent sufficient to 

9 allow the adsorbent to bind the marker; 

1 0 v) washing the adsorbent to remove unbound cell supernatant; and, 

1 1 vi) eluting the marker from the adsorbent. 

1 45. A kit comprising: 

2 (1) an adsorbent attached to a substrate, wherein the adsorbent is 

3 suitable for retaining a marker selected from the group consisting of: 

4 Marker EP1 : 3448 ± 19 Da, 

5 Marker EP2: 4036 ±22 Da, 
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6 Marker EP3: 4361 ± 24 Da, 

7 Marker EP4: 441 3 ± 24 Da, 

8 Marker EP5 : 4639 ± 26 Da, 

9 Marker EP6: 4749 ± 26 Da, 

1 0 Marker EP7: 4827± 27 Da, 

11 MarkerEP8:5666±31 Da, 

1 2 Marker EP9: 8445 ± 46 Da, 

13 Marker EP10: 11744 ±65 Da, 

14 MarkerEPll:14696±81Da, 

15 Marker EP12: 24184 ± 133 Da, 

16 Marker EP13: 48308 ±266 Da, and 

17 Marker EP14: 53830 ± 296 Da; and 

18 (2) instructions for using the substrate to detect the marker. 
19 

1 46. The kit of claim 45, wherein the instructions include methods for 



2 contacting a sample comprising the marker with the adsorbent. 
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Figure 1. 
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Figure 2b. 
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